Pf^'T WORLD INTELLECTUAL PROPERTY ORGANIZATION xSaSb' 

x ^ A International Bureau 

INTERNATIONAL APPLICATION PUBLISHED UNDER THE PATENT COOPERATION TREATY (PCT) 



(51) International Patent Classification 7 : 

C12N 15/12, C07K 14/51, 14/495, C12N 


A1 


(11) International Publication Number: 


WO 00/32773 


15/63, 5/10, C07K 16/22, C12Q 1/68, 
C12N 15/62, A61K 38/18, A61P 19/10, 
G01N 33/53, A01K 67/027 




(43) International Publication Date: 


8 June 2000 (08.06.00) 



(21) International Application Number: PCT/US99/27990 



(22) International Filing Date: 24 November 1999 (24. 11. 99) 



(30) Priority Data: 

60/1 10,283 27 November 1998 (27.1 1.98) US 



(71) Applicant (for all designated States except US): DARWIN 

DISCOVERY LTD. [GB/GB]; Cambridge Science Park, 
Milton Road, Cambridge, Cambridgeshire CB4 4WE (GB). 

(72) Inventors; and 

(75) Inveritors/Applicants (for US only): BRUNKOW, Mary, E. 
[US/US]; 9829 Triton Drive NW, Seattle, WA 98 117 (US). 
GALAS, David, J. [US/US]; 854 Guanajuato Drive, Clare- 
mont, CA 91711 (US). K.OVACEVICH, Brian [US/US]; 
4308 N.E 6th Place, Renton, WA 98059 (US). MULLI- 
GAN, John, T. [US/US]; 5823 17th Avenue Northeast, Seat- 
tle, WA 98105 (US). PAEPER, Bryan, W. [US/USJ; 1617 
Summit Avenue #43, Seattle, WA 98122 (US). VAN NESS, 
Jeffrey [US/US]; 10020 49th Avenue Northeast, Seattle, WA 
98125 (US). WINKLER, David, G. [US/US]; 7037 20th Av- 
enue NE, Seattle, WA 981 15 (US). 



(74) Agent: MCMASTERS, David, D.; Seed and Berry LLP, Suite 
6300, 701 Fifth Avenue, Seattle, WA 98104-7092 (US). 



(81) Designated States: AE, AL, AM, AT, AU, AZ, BA, BB, BG, 
BR, BY, CA, CH, CN, CR, CU, CZ, DE, DK, DM, EE, 
ES. FI, GB, GD, GE, GH, GM, HR, HU, ID, IL, IN, IS. JP, 
KE, KG, KP, KR, KZ, LC, LK, LR, LS, LT, LU, LV, MA, 
MD, MG. MK, MN, MW, MX, NO, NZ, PL, PT, RO, RU, 
SD, SE, SG, SI, SK, SL, TJ, TM, TR, TT, TZ, UA, UG, 
US, UZ, VN, YU, ZA, ZW, ARIPO patent (GH, GM, KE, 
LS, MW, SD, SL, SZ, TZ, UG, ZW), Eurasian patent (AM, 
AZ, BY, KG, KZ, MD, RU, TJ, TM), European patent (AT, 
BE, CH, CY, DE, DK, ES, FI, FR, GB, GR, IE, IT, LU, 
MC, NL, PT, SE), OAPI patent (BF, BJ, CF, CG, CL CM. 
GA, GN, GW, ML, MR, NE, SN, TD, TG). 



Published 

With international search report. 

Before the expiration of the time limit for amending the 
claims and to be republished in the event of the receipt of 
amendments. 



(54) Title: COMPOSITIONS AND METHODS FOR INCREASING BONE MINERALIZATION 



(57) Abstract 

A novel class or family of TGF -(3 bind- 
ing proteins is disclosed. Also disclosed are 
assays for selecting molecules for increasing 
bone mineralization and methods for utilizing 
such molecules. 



Common Cysteine Backbone 



i 

human qreialin . pro 
human__cerberus .pro 
hunmn_.dan.pro 
huraan_.be er . p/o 



humi n_g renl i n . pr o 
hums n_ ce rbe ru o . pro 
huoan_dan . pro 
hutnan_beer .pro 



htxnan_gromlin .pro 
human_eerberua .pro 
hut*an_dan . pro 
human__beer .pro 



hu3\an_grcmlin . pro 
huinan_carberua . pro 

hunan_dan.pro 
huoan_bcer . pro 



hujnan_gretnlin'. pro 
human_cerberua . pro 
human_dan . pro 
human_bcar.pro 



huwan — gremlin . pro 
human_cerberu» . pro 
huaai!_dan. pro 
hunanjbeer.pro 



50 

MHLLLPQLLiV LLPLGKTTRH QDORQUQSSL SPVLLPRNQR ELPTONHSEA 



51 10° 

M SRTAYTVOAL LLLLGTLLPA AEGKKKOSOO 

BEKPDLPVAV PH L VAT . SPA OEOQRQRSKN LSRFGRTWKK PffREMHPSRD 



-- MQLPLA LCLVCLLVHT 

101 150 

AI.PPPDKAQ HHDSSQTOSP OO POSH HR OR GOOROTAKPO EKVLEflQQKA 
SDSBPFPPQT QSLIQPID.O MKMSKSPLRE EAKKPWKHFH PRKTPABQOV 

; KLRVLVOAVL PAMliUVAPPP 

APRWRCQOW QAFKMDATBt IPEU2EYPHP PPELESHtTH HBACMCQRP7 

151 \|/ ^ ^ aoo 

LHVTERKYLK RPWCKTQPLK OTIHEBOCNS RTIXNRF.CY GQCWSFYIPR 
TbPIKSHHVH HHTCRTVPPS QTTTHEGCEX VWONNL.CF GKCQSVHPP. 
INKLALFFCK 8AWCEAKNIT 01VCHS0CEA KS1QNRA. CL GQCFSYSVPN 
HHPPETKDVS 3Y5CRBLHPT RYVTOGPCRS AKPVTBIjVCS GQCOPARLLP 

231 'if ^ 250 

HIRK5EQSF0 SCSP...CKP KKFTTNMVTL NCPBLQPPTK K.KRVTRVKQ 
. . OAAQH8HT SCSH...CLP AKFTTNHLPL NCTKL.SSVIK V. , . VHLVE8 
TPPOSTESLV HCDfl . . . CMP AQSKWEXVTL BCPGHBffVPR VDKLVEKItil 
NAIOROWHWR PSOPDFRCIP DRYRAQRVQi. LCPOOBAPRA RKVRLVAS . . 

200 

CftC.ISIDLD ....... 

COCXVKTEHE D0H1LHAGS0 DSP1P0VSA- — 

CSCQACOKEP 8HKQLSVYV0 GEOGPOSQPO THPHPHPHPH PCUQTPEPED 
CXCKRLTH FH HQflEWCDFOT CAARPQKORK FRPRARSARA MQABLEHAY- 



201 314 

hmoan grenlin. pro — .... — 

hu-rcan_cerberuB . pro — -- — — .. 

huo*n w dan.pro PPOAPHTEEE CARD 
hunvm_bcer . pro .... 





FOR THE PURPOSES OF INFORMATION ONLY 



Codes used to identify States party to the PCT on the front pages of pamphlets publishing international applications under the PCT. 



AL 


Albania 


. ES 


Spain 


LS 


Lesotho 


SI 


Slovenia 


AM 


Armenia 


Kl 


Finland 


LT 


Lithuania 


SK 


Slovakia 


AT 


Austria 


FR 


France 


LU 


Luxembourg 


SN 


Senegal 


AU 


Australia 


GA 


Gabon 


LV 


Latvia 


sz 


Swaziland 


AZ 


Azerbaijan 


GB 


United Kingdom 


MC 


Monaco 


TD 


Chad 


BA 


Bosnia and Herzegovina 


GE 


Georgia 


MD 


Republic of Moldova 


TG 


Togo 


BB 


Barbados 


GH 


Ghana 


MG 


Madagascar 


TJ 


Tajikistan 


BE 


Belgium 


GN 


Guinea 


MK 


The former Yugoslav 


TM 


Turkmenistan 


BF 


Burkina Faso 


GR 


Greece 




Republic of Macedonia 


TR 


Turkey 


BC 


Bulgaria 


HU 


Hungary 


ML 


Mali 


TJ 


Trinidad and Tobago 


BJ 


Benin 


IE 


Ireland 


MN 


Mongolia 


UA 


Ukraine 


BR 


Brazil 


IL 


Israel 


MR 


Mauritania 


UG 


Uganda 


BY 


Belarus 


IS 


Iceland 


MW 


Malawi 


US 


United States of America 


CA 


Canada 


IT 


Italy 


MX 


Mexico 


uz 


Uzbekistan 


CF 


Central African Republic 


JP 


Japan 


NE 


Niger 


VN 


Viet Nam 


CC 


Congo 


KB 


Kenya 


NL 


Netherlands 


YU 


Yugoslavia 


CH 


Switzerland 


KG 


Kyrgyzstan 


NO 


Norway 


ZW 


Zimbabwe 


Cl 


C&e d’Ivoire 


KP 


Democratic People’s 


NZ 


New Zealand 






CM 


Cameroon 




Republic of Korea 


PL 


Poland 






CN 


China 


KR 


Republic of Korea 


PT 


Portugal 






CU 


Cuba 


KZ 


Kazakstan 


RO 


Romania 






CZ 


Czech Republic 


LC 


Saint Lucia 


RU 


Russian Federation 






DE 


Germany 


U 


Liechtenstein 


SD 


Sudan 






DK 


Denmark 


LK 


Sri Lanka 


SB 


Sweden 






EE 


Estonia 


LR 


Liberia 


SG 


Singapore 








WO 00/32773 



PCT/US99/27990 



COMPOSITIONS AND METHODS FOR INCREASING BONE MINERALIZATION 
TECHNICAL FIELD 

The present invention relates generally to pharmaceutical products and 
5 methods and, more specifically, to methods and compositions suitable for increasing 
the mineral content of bone. Such compositions and methods may be utilized to treat a 
wide variety of conditions, including for example, osteopenia, osteoporosis, fractures 
and other disorders in which low bone mineral density are a hallmark of the disease. 

BACKGROUND OF THE INVENTION 

10 Two or three distinct phases of changes to bone mass occur over the life 

of an individual (see Riggs, West 1 Med 1 54:63-17, 1991). The first phase occurs in 
both men and women, and proceeds to attainment of a peak bone mass. This first phase 
is achieved through linear growth of the endochondral growth plates, and radial growth 
due to a rate of periosteal apposition. The second phase begins around age 30 for 
15 trabecular bone (flat bones such as the vertebrae and pelvis) and about age 40 for 
cortical bone ( e.g ., long bones found in the limbs) and continues to old age. This phase 
is characterized by slow bone loss, and occurs in both men and women. In women, a 
third phase of bone loss also occurs, most likely due to postmenopausal estrogen 
deficiencies. During this phase alone, women may lose an additional 10% of bone 
20 mass from the cortical bone and 25% from the trabecular compartment (see Riggs, 
supra). 

Loss of bone mineral content can be caused by a wide variety of 
conditions, and may result in significant medical problems. For example, osteoporosis 
is a debilitating disease in humans characterized by marked decreases in skeletal bone 
25 mass and mineral density, structural deterioration of bone including degradation of 
bone microarchitecture and corresponding increases in bone fragility and susceptibility 
to fracture in afflicted individuals. Osteoporosis in humans is preceded by clinical 
osteopenia (bone mineral density that, is greater than one standard deviation but less 
than 2.5 standard deviations below the mean value for young adult bone), a condition 
30 found in approximately 25 million people in the United States. Another 7-8 million 
patients in the United States have been diagnosed with clinical osteoporosis (defined as 
bone mineral content greater than 2.5 standard deviations below that of mature young 
adult bone). Osteoporosis is one of the most expensive diseases for the health care 
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system, costing tens of billions of dollars annually in the United States. In addition to 
health care-related costs, long-term residential care and lost working days add to the 
financial and social costs of this disease. Worldwide approximately 75 million people 
are at risk for osteoporosis. 

5 The frequency of osteoporosis in the human population increases with 

age, and among Caucasians is predominant in women (who comprise 80% of the 
osteoporosis patient pool in the United States). The increased fragility and 
susceptibility to fracture of skeletal bone in the aged is aggravated by the greater risk of 
accidental falls in this population. More than 1.5 million osteoporosis-related bone 
10 fractures are reported in the United States each year. Fractured hips, wrists, and 
vertebrae are among the most common injuries associated with osteoporosis. Hip 
fractures in particular are extremely uncomfortable and expensive for the patient, and 
for women correlate with high rates of mortality and morbidity. 

Although osteoporosis has been defined as an increase in the risk of 
15 fracture due to decreased bone mass, none of the presently available treatments for 
skeletal disorders can substantially increase the bone density of adults. There is a 
strong perception among all physicians that drugs are needed which could increase 
bone density in adults, particularly in the bones of the wrist, spinal column and hip that 
are at risk in osteopenia and osteoporosis. 

20 Current strategies for the prevention of osteoporosis may offer some 

benefit to individuals but cannot ensure resolution of the disease. These strategies 
include moderating physical activity (particularly in weight-bearing activities) with the 
onset of advanced age, including adequate calcium in the diet, and avoiding 
consumption of products containing alcohol or tobacco. For patients presenting with 
25 clinical osteopenia or osteoporosis, all current therapeutic drugs and strategies are 
directed to reducing further loss of bone mass by inhibiting the process of bone 
absorption, a natural component of the bone remodeling process that occurs 
constitutively. 

For example, estrogen is now being prescribed to retard bone loss. 
30 There is, however, some controversy over whether there is any long term benefit to 
patients and whether there is any effect at all on patients over 75 years old. Moreover, 
use of estrogen is believed to increase the risk of breast and endometrial cancer. 

High doses of dietary calcium, with or without vitamin D has also been 
suggested for postmenopausal women. However, high doses of calcium can often have 
35 unpleasant gastrointestinal side effects, and serum and urinary calcium levels must be 
continuously monitored (see Khosla and Rigss, Mayo Clin. Proc. 70: 978-982, 1995). 
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Other therapeutics which have been suggested include calcitonin, 
bisphosphonates, anabolic steroids and sodium fluoride. Such therapeutics however, 
have undesirable side effects (e.g., calcitonin and steroids may cause nausea and 
provoke an immune reaction, bisphosphonates and sodium fluoride may inhibit repair 
5 of fractures, even though bone density increases modestly) that may prevent their usage 
(see Khosla and Rigss, supra). 

No currently practiced therapeutic strategy involves a drug that 
stimulates or enhances the growth of new bone mass. The present invention provides 
compositions and methods which can be utilized to increase bone mineralization, and 
10 thus may be utilized to treat a wide variety of conditions where it is desired to increase 
bone mass. Further, the present invention provides other, related advantages. 

SUMMARY OF THE INVENTION 

As noted above, the present invention provides a novel class or family of 
TGF-beta binding-proteins, as well as assays for selecting compounds which increase 
15 bone mineral content and bone mineral density, compounds which increase bone 
mineral content and bone mineral density and methods for utilizing such compounds in 
the treatment or prevention of a wide variety of conditions. 

Within one aspect of the present invention, isolated nucleic acid 
molecules are provided, wherein said nucleic acid molecules are selected from the 
20 group consisting of: (a) an isolated nucleic acid molecule comprising sequence ID Nos. 
1, 5, 7, 9, 11, 13, or, 15, or complementary sequence thereof; (b) an isolated nucleic 
acid molecule that specifically hybridizes to the nucleic acid molecule of (a) under 
conditions of high stringency, and (c) an isolated nucleic acid that encodes a TGF-beta 
binding-protein according to (a) or (b). Within related aspects of the present invention, 
25 isolated nucleic acid molecules are provided based upon hybridization to only a portion 
of one of the above-identified sequences (e.g., for (a) hybridization may be to a probe 
of at least 20, 25, 50, or 100 nucleotides selected from nucleotides 156 to 539 or 555 to 
687 of Sequence ID No. 1). As should be readily evident, the necessary stringency to 
be utilized for hybridization may vary' based upon the size of the probe. For example, 
30 for a 25-mer probe high stringency conditions could include: 60 mM Tris pH 8.0, 2 
mM EDTA, 5x Denhardt's, 6x SSC, 0.1% (w/v) N-laurylsarcosine, 0.5% (w/v) NP-40 
(nonidet P-40) overnight at 45 degrees C, followed by two washes with with 0.2x SSC 
/ 0.1% SDS at 45-50 degrees. For a 100-mer probe under low stringency conditions, 
suitable conditions might include the following. 5x SSPE, 5x Denhardt's, and 0.5% 
35 SDS overnight at 42-50 degrees, followed by two washes with 2x SSPE (or 2x SSC) 
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/0. 1 % SDS at 42-50 degrees. 

Within related aspects of the present invention, isolated nucleic acid 
molecules are provided which have homology to Sequence ID Nos. 1, 5, 7, 9, 11, 13, or 
15, at a 50%, 60%, 75%, 80%, 90%, 95%, or 98% level of homology utilizing a 
5 Wilbur-Lipman algorithm. Representative examples of such isolated molecules 
include, for example, nucleic acid molecules which encode a protein comprising 
Sequence ID NOs. 2, 6, 10, 12, 14, or 16, or have homology to these sequences at a 
level of 50%, 60%, 75%, 80%, 90%, 95%, or 98% level of homology utilizing a 
Lipman-Pearson algorithm. 

10 Isolated nucleic acid molecules are typically less than lOOkb in size, and, 

w'ithin certain embodiments, less than 50kb, 25kb, lOkb, or even 5kb in size. Further, 
isolated nucleic acid molecules, within other embodiments, do not exist in a “library” 
of other unrelated nucleic acid molecules ( e.g a subclone BAC such as described in 
GenBank Accession No. AC003098 and EMB No. AQ1 71546). However, isolated 
15 nucleic acid molecules can be found in libraries of related molecules (e.g., for 
shuffling, such as is described in U.S. Patent Nos. 5,837,458; 5,830,721, and 
5,81 1,238). Finally, isolated nucleic acid molecules as described herein do not include 
nucleic acid molecules which encode Dan, Cerberus, Gremlin, or SCGF (U.S. Patent 
No. 5,780,263). 

20 Also provided by the present invention are cloning vectors which 

contain the above-noted nucleic acid molecules, and expression vectors which comprise 
a promoter (e.g., a regulatory sequence) operably linked to one of the above-noted 
nucleic acid molecules. Representative examples of suitable promoters include tissue- 
specific promoters, and viral - based promoters (e.g., CMV-based promoters such as 
25 CMV I-E, SV40 early promoter, and MuLV LTR), Expression vectors may also be 
based upon, or derived from viruses (e.g., a “viral vector”). Representative examples 
of viral vectors include herpes simplex viral vectors, adenoviral vectors, adenovirus- 
associated viral vectors and retroviral vectors. Also provided are host cells containing 
or comprising any of above-noted vectors (including for example, host cells of human, 
30 monkey, dog, rat, or mouse origin). 

Within other aspects of the present invention, methods of producing 
TGF-beta binding-proteins are provided, comprising the step of culturing the 
aforementioned host cell containing vector under conditions and for a time sufficient to 
produce the TGF-beta binding protein. Within further embodiments, the protein 
35 produced by this method may be further purified (e.g., by column chromatography, 
affinity purification, and the like). Hence, isolated proteins which are encoded by the 
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above-noted nucleic acid molecules ( e.g Sequence ID NOs. 2, 4, 6, 8, 10, 12, 14, or 
16) may be readily produced given the disclosure of the subject application. 

It should also be noted that the aforementioned proteins, or fragments 
thereof, may be produced as fusion proteins. For example, within one aspect fusion 
5 proteins are provided comprising a first polypeptide segment comprising a TGF-beta 
binding-protein encoded by a nucleic acid molecule as described above, or a portion 
thereof of at least 10, 20, 30, 50, or 100 amino acids in length, and a second 
polypeptide segment comprising a non-TGF-beta binding-protein. Within certain 
embodiments, the second polypeptide may be a tag suitable for purification or 
10 recognition (e g., a polypeptide comprising multiple anionic amino acid residues - see 
U.S. Patent No. 4,851,341), a marker (e.g., green fluorescent protein, or alkaline 
phosphatase), or a toxic molecule ( e.g ricin). 

Within another aspect of the present invention, antibodies are provided 
which are capable of specifically binding the above-described class of TGF-beta 
15 binding proteins ( e.g ., human BEER). Within various embodiments, the antibody may 
be a polyclonal antibody, or a monoclonal antibody {e.g., of human or murine origin). 
Within further embodiments, the antibody is a fragment of an antibody which retains 
the binding characteristics of a whole antibody {e.g., an F(ab’) 2 , F(ab) 2 , Fab’, Fab, or 
Fv fragment, or even a CDR). Also provided are hybridomas and other cells which are 
20 capable of producing or expressing the aforementioned antibodies. 

Within related aspects of the invention, methods are provided detecting a 
TGF-beta binding protein, comprising the steps of incubating an antibody as described 
above under conditions and for a time sufficient to permit said antibody to bind to a 
TGF-beta binding protein, and detecting the binding. Within various embodiments the 
25 antibody may be bound to a solid support to facilitate washing or separation, and/or 
labeled, {e.g., with a marker selected from the group consisting of enzymes, fluorescent 
proteins, and radioisotopes). 

Within other aspects of the present invention, isolated oligonucleotides 
are provided which hybridize to a nucleic acid molecule according to Sequence ID 
30 NOs. 1, 3, 5, 7, 9, U, 13, 15, 17, or 18 or the complement thereto, under conditions of 
high stringency. Within further embodiments, the oligonucleotide may be found in the 
sequence which encodes Sequence ID Nos. 2, 4, 6, 8, 10, 12, 14, or 16. Within certain 
embodiments, the oligonucleotide is at least 15, 20, 30, 50, or 100 nucleotides in 
length. Within further embodiments, the oligonucleotide is labeled with another 
35 molecule {e.g., an enzyme, fluorescent molecule, or radioisotope). Also provided are 
primers w'hich are capable of specifically amplifying all or a portion of the above- 
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mentioned nucleic acid molecules which encode TGF-beta binding-proteins. As 
utilized herein, the term “specifically amplifying” should be understood to refer to 
primers which amplify the aforementioned TGF-beta binding-proteins, and not other 
TGF-beta binding proteins such as Dan, Cerberus, Gremlin, or SCGF (U.S. Patent No. 

5 5,780,263). 

Within related aspects of the present invention, methods are provided for 
detecting a nucleic acid molecule which encodes a TGF-beta binding protein, 
comprising the steps of incubating an oligonucleotide as described above under 
conditions of high stringency, and detecting hybridization of said oligonucleotide. 
10 Within certain embodiments, the oligonucleotide may be labeled and/or bound to a 
solid support. 

Within other aspects of the present invention, ribozymes are provided 
which are capable of cleaving RNA which encodes one of the above-mentioned TGF- 
beta binding-proteins (e.g., Sequence ID NOs. 2, 6, 8, 10, 12, 14, or 16). Such 
15 ribozymes may be composed of DNA, RNA (including 2’-0-methyl ribonucleic acids), 
nucleic acid analogs ( e.g ., nucleic acids having phosphorothioate linkages) or mixtures 
thereof. Also provided are nucleic acid molecules (e.g., DNA or cDNA) w'hich encode 
these ribozymes, and vectors which are capable of expressing or producing the 
ribozymes. Representative examples of vectors include plasmids, retrotransposons, 
20 cosmids, and viral-based vectors (e.g., viral vectors generated at least in part from a 
retrovirus, adenovirus, or, adeno-associated virus). Also provided are host cells (e.g., 
human, dog, rat, or mouse cells) which contain these vectors. In certain embodiments, 
the host cell may be stably transformed with the vector. 

Within further aspects of the invention, methods are provided for 
25 producing ribozymes either synthetically, or by in vitro or in vivo transcription. Within 
further embodiments, the ribozymes so produced may be further purified and/or 
formulated into pharmaceutical compositions (eg., the ribozyme or nucleic acid 
molecule encoding the ribozyme along with a pharmaceutically acceptable carrier or 
diluent). Similarly, the antisense oligonucleotides and antibodies or other selected 
30 molecules described herein may be formulated into pharmaceutical compositions. 

Within other aspects of the present invention, antisense oligonucleotides 
are provided comprising a nucleic acid molecule which hybridizes to a nucleic acid 
molecule according to Sequence ID NOs. 1, 3, 5, 7, 9, 1 1, 13, or 15, or the complement 
thereto, and wherein said oligonucleotide inhibits the expression of TGF-beta binding 
35 protein as described herein (e.g., human BEER) Within various embodiments, the 
oligonucleotide is 15, 20, 25, 30, 35, 40, or 50 nucleotides in length. Preferably, the 
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oligonucleotide is less than 100, 75, or 60 nucleotides in length. As should be readily 
evident, the oligonucleotide may be comprised of one or more nucleic acid analogs, 
ribonucleic acids, or deoxyribonucleic acids. Further, the oligonucleotide may be 
modified by one or more linkages, including for example, covalent linkage such as a 
5 phosphorothioate linkage, a phosphotri ester linkage, a methyl phosphonate linkage, a 
methylene(methylimino) linkage, a morpholino linkage, an amide linkage, a polyamide 
linkage, a short chain alkyl intersugar linkage, a cycloalkyl intersugar linkage, a short 
chain heteroatomic intersugar linkage and a heterocyclic intersugar linkage. One 
representative example of a chimeric oligonucleotide is provied in U S. Patent No. 
10 5,989,912. 

Within yet another aspect of the present invention, methods are provided 
for increasing bone mineralization, comprising introducing into a warm-blooded animal 
an effective amount of the ribozyme as described above. Within related aspects, such 
methods comprise the step of introducing into a patient an effective amount of the 
15 nucleic acid molecule or vector as described herein which is capable of producing the 
desired ribozyme, under conditions favoring transcription of the nucleic acid molecule 
to produce the ribozyme. 

Within other aspects of the invention transgenic, non-human animals are 
provided. Within one embodiment a transgenic animal is provided whose germ cells 
20 and somatic cells contain a nucleic acid molecule encoding a TGF-beta binding-protein 
as described above which is operably linked to a promoter effective for the expression 
of the gene, the gene being introduced into the animal, or an ancestor of the animal, at 
an embryonic stage, with the proviso that said animal is not a human. Within other 
embodiments, transgenic knockout animals are provided, comprising an animal whose 
25 germ cells and somatic cells comprise a disruption of at least one allele of an 
endogenous nucleic acid molecule which hybridizes to a nucleic acid molecule which 
encodes a TGF-binding protein as described herein, wherein the disruption prevents 
transcription of messenger RNA from said allele as compared to an animal without the 
disruption, with the proviso that the animal is not a human. Within various 
30 embodiments, the disruption is a nucleic acid deletion, substitution, or, insertion. 
Within other embodiments the transgenic animal is a mouse, rat, sheep, pig, or dog. 

Within further aspects of the invention, kits are provided for the 
detection of TGF-beta binding-protein gene expression, comprising a container that 
comprises a nucleic acid molecule, wherein the nucleic acid molecule is selected from 
35 the group consisting of (a) a nucleic acid molecule comprising the nucleotide sequence 
of SEQ ID NOs. 1, 3, 5, 7, 9, 11, 13, or 15; (b) a nucleic acid molecule comprising the 
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complement of the nucleotide sequence of (a); (c) a nucleic acid molecule that is a 
fragment of (a) or (b) of at least 15, 20 30, 50, 75, or, 100 nucleotides in length. Also 
provided are kits for the detection of a TGF-beta binding-protein which comprise a 
container that comprise one of the TGF-beta binding protein antibodies described 
5 herein. 

For example, within one aspect of the present invention methods are 
provided for determining whether a selected molecule is capable of increasing bone 
mineral content, comprising the steps of (a) mixing one or more candidate molecules 
with TGF-beta-binding-protein encoded by the nucleic acid molecule according to 
10 claim 1 and a selected member of the TGF-beta family of proteins (e.g., BMP 5 or 6), 
(b) determining whether the candidate molecule alters the signaling of the TGF-beta 
family member, or alters the binding of the TGF-beta binding-protein to the TGF-beta 
family member. Within certain embodiments, the molecule alters the ability of TGF- 
beta to function as a positive regulator of mesenchymal cell differentiation. Within 
15 this aspect of the present invention, the candidate molecule(s) may alter signaling or 
binding by, for example, either decreasing ( e.g ., inhibiting), or increasing (e.g . , 
enhancing) signaling or binding. 

Within yet another aspect, methods are provided for determining 
whether a selected molecule is capable of increasing bone mineral content, comprising 
20 the step of determining whether a selected molecule inhibits the binding of TGF-beta 
binding-protein to bone, or an analogue thereof. Representative examples of bone or 
analogues thereof include hydroxyapatite and primary human bone samples obtained 
via biopsy. 

Within certain embodiments of the above-recited methods, the selected 
25 molecule is contained within a mixture of molecules and the methods may further 
comprise the step of isolating one or more molecules which are functional within the 
assay. Within yet other embodiments, TGF-beta family of proteins is bound to a solid 
support and the binding of TGF-beta binding-protein is measured or TGF-beta binding- 
protein are bound to a solid support and the binding of TGF-beta proteins are measured. 
30 Utilizing methods such as those described above, a wide variety of 

molecules may be assayed for their ability to increase bone mineral content by 
inhibiting the binding of the TGF-beta binding-protein to the TGF-beta family of 
proteins. Representative examples of such molecules include proteins or peptides, 
organic molecules, and nucleic acid molecules. 

35 Within other related aspects of the invention, methods are provided for 

increasing bone mineral content in a warm-blooded animal, comprising the step of 
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administering to a warm-blooded animal a therapeutically effective amount of a 
molecule identified from the assays recited herein. Within another aspect, methods are 
provided for increasing bone mineral content in a warm-blooded animal, comprising 
the step of administering to a warm-blooded animal a therapeutically effective amount 
5 of a molecule which inhibits the binding of the TGF-beta binding-protein to the TGF- 
beta super-family of proteins, including bone morphogenic proteins (BMPs). 
Representative examples of suitable molecules include antisense molecules, ribozymes, 
ribozyme genes, and antibodies (eg., a humanized antibody) which specifically 
recognize and alter the activity of the TGF-beta binding-protein. 

10 Within another aspect of the present invention, methods are provided for 

increasing bone mineral content in a warm-blooded animal, comprising the steps of 
(a) introducing into cells which home to the bone a vector which directs the expression 
of a molecule which inhibits the binding of the TGF-beta binding-protein to the TGF- 
beta family of proteins and bone morphogenic proteins (BMPs), and (b) administering 
15 the vector-containing cells to a warm-blooded animal. As utilized herein, it should be 
understood that cells “home to bone” if they localize within the bone matrix after 
peripheral administration. Within one embodiment, such methods further comprise, 
prior to the step of introducing, isolating cells from the marrow of bone which home to 
the bone. Within a further embodiment, the cells which home to bone are selected from 
20 the group consisting of CD34+ cells and osteoblasts. 

Within other aspects of the present invention, molecules are provided 
(preferably isolated) which inhibit the binding of the TGF-beta binding-protein to the 
TGF-beta super-family of proteins. 

Within further embodiments, the molecules may be provided as a 
25 composition, and can further comprise an inhibitor of bone resorption. Representative 
examples of such inhibitors include calcitonin, estrogen, a bisphosphonate, a growth 
factor having anti-resorptive activity and tamoxifen. 

Representative examples of molecules which may be utilized in the 
afore-mentioned therapeutic contexts include, e.g., ribozymes, ribozyme genes, 
30 antisense molecules, and/or antibodies (e.g., humanized antibodies). Such molecules 
may depending upon their selection, used to alter, antagonize, or agonize the signalling 
or binding of a TGF-beta binding-protein family member as described herein 

Within various embodiments of the invention, the above-described 
molecules and methods of treatment or prevention may be utilized on conditions such 
35 as osteoporosis,' osteomalasia, periodontal disease, scurvy, Cushing’s Disease, bone 
fracture and conditions due to limb immobilization and steroid usage. 
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These and other aspects of the present invention will become evident 
upon reference to the following detailed description and attached drawings. In 
addition, various references are set forth herein which describe in more detail certain 
procedures or compositions ( e.g ., plasmids, etc.), and are therefore incorporated by 
5 reference in their entirety. 

BRIEF DESCRIPTION OF THE DRAWINGS 

Figure 1 is a schematic, illustration comparing the amino acid sequence 
of Human Dan, Human Gremlin; Human Cerberus and Human Beer. Arrows indicate 
the Cysteine backbone. 

10 Figure 2 summarizes the results obtained from surveying a variety of 

human tissues for the expression of a TGF-beta binding-protein gene, specifically, the 
Human Beer gene. A semi-quantitative Reverse Transcription-Polymerase Chain 
Reaction (RT-PCR) procedure was used to amplify a portion of the gene from first- 
strand cDNA synthesized from total RNA (described in more detail in EXAMPLE 2A). 
15 Figure 3 summarizes the results obtained from RNA in situ 

hybridization of mouse embryo sections, using a cRNA probe that is complementary to 
the mouse Beer transcript (described in more detail in EXAMPLE 2B). Panel A is a 
transverse section of 10.5 dpc embryo. Panel B is a sagittal section of 12.5 dpc embryo 
and panels C and D are sagittal sections of 15.5 dpc embryos. 

20 Figure 4 illustrates, by western blot analysis, the specificity of three 

different polyclonal antibodies for their respective antigens (described in more detail in 
EXAMPLE 4). Figure 4A shows specific reactivity of an anti-H. Beer antibody for H. 
Beer antigen, but not H. Dan or H. Gremlin. Figure 4B shows reactivity of an anti-H. 
Gremlin antibody for H. Gremlin antigen, but not H. Beer or H. Dan. Figure 4C shows 
25 reactivity of an anti-H. Dan antibody for H. Dan, but not H. Beer or H. Gremlin. 

Figure 5 illustrates, by western blot analysis, the selectivity of the TGF- 
beta binding-protein, Beer, for BMP-5 and BMP-6, but not BMP-4 (described in more 
detail in EXAMPLE 5). 

Figure 6 demonstrates that the ionic interaction between the TGF-beta 
30 binding-protein. Beer, and BMP-5 has a dissociation constant in the 1 5-30 nM range. 
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DETAILED DESCRIPTION OF THE INVENTION 

Definitions 

Prior to setting forth the invention in detail, it may be helpful to an 
understanding thereof to set forth definitions of certain terms and to list and to define 
5 the abbreviations that will be used hereinafter. 

“ Molecule ” should be understood to include proteins or peptides 
(eg., antibodies, recombinant binding partners, peptides with a desired binding 
affinity), nucleic acids (c.g., DNA, RNA, chimeric nucleic acid molecules, and nucleic 
acid analogues such as PNA); and organic or inorganic compounds. 

10 “ TGF-beta ” should be understood to include any known or novel 

member of the TGF-beta super-family, which also includes bone morphogenic proteins 
(BMPs). 

“ TGF-beta receptor ” should be understood to refer to the receptor 
specific for a particular member of the TGF-beta super-family (including bone 
15 morphogenic proteins (BMPs)). 

“ TGF-beta binding-protein ” should be understood to refer to a protein 
with specific binding affinity for a particular member or subset of members of the 
TGF-beta super-family (including bone morphogenic proteins (BMPs)). Specific 
examples of TGF-beta binding-proteins include proteins encoded by Sequence ID Nos. 
20 1, 5, 7, 9, 11, 13, and 15. 

Inhibiting the “ binding of the TGF-beta binding-protein to the TGF-beta 
family of proteins and bone morphogenic proteins CBMPs Y’ should be understood to 
refer to molecules which allow the activation of TGF-beta or bone morphogenic 
proteins (BMPs), or -allow the binding of TGF-beta family members including bone 
25 morphogenic proteins (BMPs) to their respective receptors, by removing or preventing 
TGF-beta from binding to TGF-binding-protein. Such inhibition may be accomplished, 
for example, by molecules which inhibit the binding of the TGF-beta binding-protein to 
specific members of theTGF-beta super-family. 

“ Vector” refers to an assembly which is capable of directing the 
30 expression of desired protein. The vector must include transcriptional promoter 
elements which are operably linked to the gene(s) of interest. The vector may be 
composed of either deoxyribonucleic acids (“DNA”), ribonucleic acids (“RNA”), or a 
combination of the two (e.g., a DNA-RNA chimeric). Optionally, the .vector may 
include a polyadenylation sequence, one or more restriction sites, as well as one or 
35 more selectable markers such as neomycin phosphotransferase or hygromycin 
phosphotransferase. Additionally, depending on the host cell chosen and the vector 
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employed, other genetic elements such as an origin of replication, additional nucleic 
acid restriction sites, enhancers, sequences conferring inducibility of transcription, and 
selectable markers, may also be incorporated into the vectors described herein. 

An “ isolated nucleic acid molecule " is a nucleic acid molecule that is not 
5 integrated in the genomic DNA of an organism. For example, a DNA molecule that 
encodes a TGF-binding protein that has been separated from the genomic DNA of a 
eukaryotic cell is an isolated DNA molecule. Another example of an isolated nucleic acid 
molecule is a chemically-synthesized nucleic acid molecule that is not integrated in the 
genome of an organism. The isolated nucleic acid molecule may be genomic DNA, cDNA, 
10 RNA, or composed at least in part of nucleic acid analogs. 

An ‘ isolated polypeptide ” is a polypeptide that is essentially free from 
contaminating cellular components, such as carbohydrate, lipid, or other proteinaceous 
impurities associated with the polypeptide in nature. Within certain embodiments, a 
particular protein preparation contains an isolated polypeptide if it appears nominally as 
15 a single band on SDS-PAGE gel with Coomassie Blue staining. “ Isolated ” when 
referring to organic molecules means that the compounds are greater than 90 percent 
pure utilizing methods which are well known in the art (c.g., NMR, melting point). 

“ Scierosteosis ” Sclerosteosis is a term that was applied by Hansen 
(1967) (Hansen, H. G., Sklerosteose.In: Opitz, H.; Schmid, F., Handbuch der 
20 Kinderheilkunde. Berlin: Springer (pub.) 6 1967. Pp. 351-355) to a disorder similar to 
van Buchem hyperostosis corticalis generalisata but possibly differing in radiologic 
appearance of the bone changes and in the presence of asymmetric cutaneous 
syndactyly of the index and middle fingers in many cases. The jaw has an unusually 
square appearance in this condition. 

25 “ Humanized antibodies ” are recombinant proteins in which murine 

complementary determining regions of monoclonal antibodies have been transferred 
from heavy and light variable chains of the murine immunoglobulin into a human 
variable domain. 

As used herein, an “ antibody fragment ” is a portion of an antibody such 
30 as F(ab’) 2 , F(ab) 2 , Fab’, Fab, and the like. Regardless of structure, an antibody 
fragment binds with the same antigen that is recognized by the intact antibody. For 
example, an anti-TGF-beta binding-protein monoclonal antibody fragment binds with 
an epitope of TGF-beta binding-protein. 

The term “ antibody fragment ” also includes any synthetic or genetically 
35 engineered protein that acts like an antibody by binding to a specific antigen to form a 
complex. For example, antibody fragments include isolated fragments consisting of the 




WO 00/32773 



PCTYUS99/27990 



13 



light chain variable region, ‘Tv” fragments consisting of the variable regions of the 
heavy and light chains, recombinant single chain polypeptide molecules in which light 
and heavy variable regions are connected by a peptide linker (“sFv proteins”), and 
minimal recognition units consisting of the amino acid residues that mimic the 
5 hypervariable region. 

A “ detectable label ” is a molecule or atom which can be conjugated to 
an antibody moiety to produce a molecule useful for diagnosis. Examples of detectable 
labels include chelators, photoactive agents, radioisotopes, fluorescent agents, 
paramagnetic ions, enzymes, and other marker moieties. 

10 As used herein, an “ immunoconiuuate ” is a molecule comprising an 

anti-TGF-beta binding-protein antibody, or an antibody fragment, and a detectable 
label. An immunoconjugate has roughly the same, or only slightly reduced, ability to 
bind TGF-beta binding-protein after conjugation as before conjugation. 

Abbreviations : TGF-beta - “Transforming Growth Factor-beta”; TGF- 
15 bBP - “Transforming Growth Factor-beta binding-protein” (one representative TGF- 
bBP is designated “H. Beer”); BMP - “bone morphogenic protein”; PCR - 
“polymerase chain reaction”, RT-PCR - PCR process in which RNA is first transcribed 
into DNA at the first step using reverse transcriptase (RT); cDNA - any DNA made by 
copying an RNA sequence into DNA form. 

20 

As noted above, the present invention provides a novel class of TGF- 
beta binding-proteins, as well as methods and compositions for increasing bone mineral 
content in warm-blooded animals. Briefly, the present inventions are based upon the 
unexpected discovery that a mutation in the gene which encodes a novel member of the 
25 TGF-beta binding-protein family results in a rare condition (sclerosteosis) characterized 
by bone mineral contents which are one- to four-fold higher than in normal individuals. 
Thus, as discussed in more detail below this discovery has led to the development of 
assays which may be utilized to select molecules which inhibit the binding of the TGF- 
beta binding-protein to the TGF-beta family of proteins and bone morphogenic proteins 
30 (BMPs), and methods of utilizing such molecules for increasing the bone mineral 
content of warm-blooded animals (including for example, humans). 

Discussion of the Disease Known as Sclerosteosis 

Sclerosteosis is a term that was applied by Hansen (1967) (Hansen, H 
G., Sklerosteose.ln: Opitz, H.; Schmid, F., Handbuch der Kinderheilkunde. Berlin: 
35 Springer (pub.) 6 1967. Pp. 351-355) to a disorder similar to van Buchem hyperostosis 
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corticalis generalisata but possibly differing in radiologic appearance of the bone 
changes and in the presence of asymmetric cutaneous syndactyly of the index and 
middle fingers in many cases. 

Sclerosteosis is now known to be an autosomal semi-dominant disorder 
5 which is characterized by widely disseminated sclerotic lesions of the bone in the adult. 
The condition is progressive. Sclerosteosis also has a developmental aspect which is 
associated with syndactyly (two or more fingers are fused together). The Sclerosteosis 
Syndrome is associated with large stature and many affected individuals attain a height 
of six feet or more. The bone mineral content of homozygotes can be I to 6 fold over 
10 normal individuals and bone mineral density can be 1 to 4 fold above normal values 
(e.g., from unaffected siblings). 

I he Sclerosteosis Syndrome occurs primarily in Afrikaaners of Dutch 
descent in South Africa. Approximately 1/140 individuals in the Afrikaaner population 
are carriers of the mutated gene (heterozygotes). The mutation shows 100% 
15 penetrance. There are anecdotal reports of increased of bone mineral density in 
heterozygotes with no associated pathologies (syndactyly or skull overgrowth). 

It appears at the present time that there is no abnormality of the 
pituitary-hypothalamus axis in Sclerosteosis. In particular, there appears to be no over- 
production of growth hormone and cortisone. In addition, sex hormone levels are 
20 normal in affected individuals. However, bone turnover markers (osteoblast specific 
alkaline phosphatase, osteocalcin, type 1 procollagen C’ propeptide (PICP), and total 
alkaline phosphatase; (see Cornier, C., Cun. Opin. in Rheti. 7:243, 1995) indicate that 
there is hyperosteoblastic activity associated with the disease but that there is normal to 
slightly decreased osteoclast activity' as, measured by markers of bone resorption 
25 (pyridinoline, deoxypryridinoline, N-telopeptide, urinary hydroxyproline, plasma 
tartrate-resistant acid phosphatases and galactosyl hydroxylysine (see Cornier, supra)). 

Sclerosteosis is characterized by the continual deposition of bone 
throughout the skeleton during the lifetime of the affected individuals. In homozygotes 
the continual deposition of bone mineral leads to an overgrowth of bone in areas of the 
30 skeleton where there is an absence of mechanoreceptors (skull, jaw, cranium). In 
homozygotes with Sclerosteosis, the overgrowth of the bones of the skull leads to 
cranial compression and eventually to death due to excessive hydrostatic pressure on 
the brain stem. In all other parts of the skeleton there is a generalized and diffuse 
sclerosis. Cortical areas of the long bones are greatly thickened resulting in a 
35 substantial increase in bone strength. Trabecular connections are increased in thickness 
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which in turn increases the strength of the trabecular bone. Sclerotic bones appear 
unusually opaque to x-rays 

As described in more detail in Example I, the rare genetic mutation that 
is responsible for the Sclerosteosis syndrome has been localized to the region of human 
5 chromosome 17 that encodes a novel member of the TGF-beta binding-protein family 
(one representative example of which is designated “H. Beer"). As described in more 
detail below, based upon this discovery, the mechanism of bone mineralization is more 
fully understood, allowing the development of assays for molecules which increase 
bone mineralization, and use of such molecules to increase bone mineral content, and in 
10 the treatment or prevention of a wide number of diseases. 

TGF-beta Super-family 

The Transforming Growth Factor-beta (TGF-beta) super-family contains 
a variety of growth factors that share common sequence elements and structural motifs 
(at both the secondary and tertiary levels). This protein family is known to exert a wide 
15 spectrum of biological responses on a large variety of cell types. Many of them have 
important functions during the embryonal development in pattern formation and tissue 
specification; in adults they are involved, e.g ., in wound healing and bone repair and 
bone remodeling, and in the modulation of the immune system. In addition to the three 
TGF-beta’s, the super-family includes the Bone Morphogenic Proteins (BMPs), 
20 Activins, Inhibins, Growth and Differentiation Factors (GDFs), and Glial-Derived 
Neurotrophic Factors (GDNFs). Primary classification is established through general 
sequence features that bin a specific protein into a general sub-family. Additional 
stratification within the sub-family is possible due to stricter sequence conservation 
between members of the smaller group. In certain instances, such as with BMP-5, 
25 BMP-6 and BMP-7, this can be as high as 75 percent amino acid homology between 
members of the smaller group. This level of identity enables a single representative 
sequence to illustrate the key . biochemical elements of the sub-group that separates it 
from other members of the larger family. 

TGF-beta signals by inducing the formation of hetero-oligomeric 
30 complexes of type 1 and type II receptors. The crystal structure of TGF-beta2 has been 
determined. The general fold of the TGF-beta2 monomer contains a stable, compact, 
cysteine knotlike structure formed by three disulphide bridges. Dimerization, stabilized 
by one disulphide bridge, is antiparallel. 

TGF-beta family members initiate their cellular action by binding to 
35 receptors with intrinsic serine/threonine kinase activity This receptor family consists 
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of two subfamilies, denoted type 1 and type II receptors. Each member of the TGF-beta 
family binds to a characteristic combination of type I and type II receptors, both of 
which are needed for signaling. In the current model for TGF-beta receptor activation, 
TGF-beta first binds to the type II receptor (TbR-Il), which occurs in the cell 
5 membrane in an oligomeric form with activated kinase. Thereafter, the type 1 receptor 
(TbR-I), which can not bind ligand in the absence of TbR-II, is recruited into the 
complex. TbR-II then phosphorylates TbR-I predominantly in a domain rich in glycine 
and serine residues (GS domain) in the juxtamembrane region, and thereby activates 
TbR-I. 

10 Thus far seven type I receptors and five type II receptors have been 

identified. 

Bone Morphogenic Proteins (BMPs) ark Key Regulatory Proteins in 
Determining Bone Mineral Density in Humans 
A major advance in the understanding of bone formation was the 
15 identification of the bone morphogenic proteins (BMPs), also known as osteogenic 
proteins (OPs), which regulate cartilage and bone differentiation in vivo. BMPs/OPs 
induce endochondral bone differentiation through a cascade of events which include 
formation of cartilage, hypertrophy and calcification of the cartilage, vascular invasion, 
differentiation of osteoblasts, and formation of bone. As described above, the 
20 BMPs/OPs (BMP 2-14, and osteogenic protein 1 and -2, OP-1 and OP-2) are members 
of the TGF-beta super-family. The striking evolutionary conservation between 
members the BMP/OP sub-family suggests that they are critical in the normal 
development and function of animals. Moreover, the presence of multiple forms of 
BMPs/OPs raises an important question about the biological relevance of this apparent 
25 redundancy. In addition to postfetal chondrogenesis and osteogenesis, the BMPs/OPs 
play multiple roles in skeletogenesis (including the development of craniofacial and 
dental tissues) and in embryonic development and organogenesis of parenchymatous 
organs, including the kidney. It is now understood that nature relies on common (and 
few) molecular mechanisms tailored to provide the emergence of specialized tissues 
30 and organs. The BMP/OP super-family is an elegant example of nature parsimony in 
programming multiple specialized functions deploying molecular isoforms with minor 
variation in amino acid motifs within highly conserved carboxy-terminal regions. 

BMP Antagonism 



The BMP and Activin sub-families are subject to significant post- 
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translational regulation. An intricate extracellular control system exists, whereby a 
high affinity antagonist is synthesized and exported, and subsequently complexes 
selectively with BMPs or activins to disrupt their biological activity (W.C. Smith 
(1999) TIG 75(1) 3-6). A number of these natural antagonists have been identified, and 
5 based on sequence divergence appear to have evolved independently due to the lack of 
primary sequence conservation. There has been no structural work to date on this class 
of proteins. Studies of these antagonists has highlighted a distinct preference for 
interacting and neutralizing BMP-2 and BMP-4. Furthermore, the mechanism of 
inhibition seems to differ for the different antagonists (S. lemura et al. (1998) Pruc 
10 Nat! Acad Sci USA 95 9337-9342) 

Novf.i, TGF-beta Binding-proteins 



1. Background re: TGF-beta binding-proteins 

As noted above, the present invention provides a novel class of TGF- 
beta binding-proteins that possess a nearly identical cysteine (disulfide) scaffold when 
15 compared to Human DAN, Human Gremlin, and Human Cerberus, and SCGF (U S. 
Patent No. 5,780,263) but almost no homology at the nucleotide level (for background 
information, see generally Hsu, DR., Economides, A N., Wang, X., Eimon, P.M., 
Harland, R.M., “The Xenopus Dorsalizing Factor Gremlin Identifies a Novel Family of 
Secreted Proteins that Antagonize BMP Activities,” Molecular Cell 7:673-683, 1998). 

20 One representative example of the novel class of TGF-beta binding- 

proteins is disclosed in Sequence ID Nos. 1, 5, 9, 11, 13, and 15. Representative 
members of this class of binding proteins should also be understood to include variants 
of the TGF-beta binding-protein (e.g.. Sequence ID Nos. 5 and 7). As utilized herein, a 
“TGF-beta binding-protein variant gene” refers to nucleic acid molecules that encode a 
25 polypeptide having an amino acid sequence that is a modification of SEQ ID Nos: 2, 
10, 12, 14 or 16. Such variants include naturally-occurring polymorphisms or allelic 
variants of TGF-beta binding-protein genes, as well as synthetic genes that contain 
conservative amino acid substitutions of these amino acid sequences. Additional 
variant forms of a TGF-beta binding-protein gene are nucleic acid molecules that 
30 contain insertions or deletions of the nucleotide sequences described herein. TGF-beta 
binding-protein variant genes can be identified by determining whether the genes 
hybridize with a nucleic acid molecule having the nucleotide sequence of SEQ ID Nos: 
1, 5, 7, 9, 11, 13, or 15 under stringent conditions. In addition, TGF-beta binding- 
protein variant genes should encode a protein having a cysteine backbone. 




WO 00/32773 



PCT/US99/27990 



18 



As an alternative, TGF-beta binding-protein variant genes can be 
identified by sequence comparison. As used herein, two amino acid sequences have 
“100% amino acid sequence identity’" if the amino acid residues of the two amino acid 
sequences are the same when aligned for maximal correspondence. Similarly, two 
5 nucleotide sequences have “100% nucleotide sequence identity” if the nucleotide 
residues of the two nucleotide sequences are the same when aligned for maximal 
correspondence. Sequence comparisons can be performed using standard software 
programs such as those included in the LASERGENE bioinformatics computing suite, 
which is produced by DNASTAR (Madison, Wisconsin). Other methods for 
10 comparing two nucleotide or amino acid sequences by determining optimal alignment 
are well-known to those of skill in the art (see, for example, Peruski and Peruski, The 
Internet and the New Biology: Tools for Genomic and Molecular Research (ASM 
Press, Inc. 1997), Wu et al. (eds.), “Information Superhighway and Computer 
Databases of Nucleic Acids and Proteins,” in Methods in Gene Biotechnology, pages 
15 123-151 (CRC Press, Inc. 1997), and Bishop (ed ). Guide to Human Genome 

Computing, 2nd Edition (Academic Press, Inc. 1998)). 

A variant TGF-beta binding-protein should have at least a 50% amino 
acid sequence identity to SEQ ID NOs: 2, 6, 10, 12, 14 or 16 and preferably, greater 
than 60%, 65%, 70%, 75%, 80%, 85%, 90%, or 95% identity. Alternatively, TGF-beta 
20 binding-protein variants can be identified by having at least a 70% nucleotide sequence 
identity to SEQ ID NOs: 1, 5, 9, 11, 13 or 15. Moreover, the present invention 
contemplates TGF-beta binding-protein gene variants having .greater than 75%, 80%, 
85%, 90%, or 95% identity to SEQ ID NO: 1 . Regardless of the particular method used 
to identify a TGF-beta binding-protein variant gene or variant TGF-beta binding- 
25 protein, a variant TGF-beta binding-protein or a polypeptide encoded by a variant 
TGF-beta binding-protein gene can be functionally characterized by, for example, its 
ability to bind to and/or inhibit the signaling of a selected member of the TGF-beta 
family of proteins, or by its ability to bind specifically to an anti-TGF-beta binding- 
protein antibody. 

30 The present invention includes functional fragments of TGF-beta 

binding-protein genes. Within the context of this invention, a “functional fragment” of 
a TGF-beta binding-protein gene refers to a nucleic acid molecule that encodes a 
portion of a TGF-beta binding-protein polypeptide which either (1) possesses the 
above-noted function activity, or (2) specifically binds with an anti-TGF-beta binding- 
35 protein antibody. For example, a functional fragment of a TGF-beta binding-protein 
gene described herein comprises a portion of the nucleotide sequence of SEQ ID Nos: 
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1, 5, 9, 11, 13, or 15. 

2. Isolation of the TGF-beta binding-protein gene 

DNA molecules encoding a binding-protein gene can be obtained by 
screening a human cDNA or genomic library using polynucleotide probes based upon, 
5 for example, SEQ ID NO: 1 . 

For example, the first step in the preparation of a cDNA library is to isolate 
RNA using methods well-known to those of skill in the art. In general, RNA isolation 
techniques must provide a method for breaking cells, a means of inhibiting RNase-directed 
degradation of RNA, and a method of separating RNA from DNA, protein, and 
10 polysaccharide contaminants. For example, total RNA can be isolated by freezing tissue in 
liquid nitrogen, grinding the frozen tissue with a mortar and pestle to lyse the cells, 
extracting the ground tissue with a solution of phenol/chloroform to remove proteins, and 
separating RNA from the remaining impurities by selective precipitation with lithium 
chloride (see, for example, Ausubel et al. (eds ). Short Protocols in Molecular Biology, 3rd 
15 Edition, pages 4-1 to 4-6 (John Wiley & Sons 1995) [“Ausubel (1995)”]; Wu et al., 
Methods in Gene Biotechnology, pages 33-41 (CRC Press, Inc. 1997) [“Wu (1997)”]). 

Alternatively, total RNA can be isolated by extracting ground tissue with 
guanidinium isothiocyanate, extracting with organic solvents, and separating RNA from 
contaminants using differential centrifugation (see, for example, Ausubel (1995) at pages 4- 
20 1 to 4-6; Wu (1997) at pages 33-41). 

In order to construct a cDNA library, poly(A) + RNA must be isolated from a 
total RNA preparation. Polv(A)' RNA can be isolated from total RNA by using the 
standard technique of oligo(dT)-cellulose chromatography (see, for example, Ausubel 
(1995) at pages 4-1 1 to 4-12). 

25 Double-stranded cDNA molecules are synthesized from poly(A) + RNA 

using techniques well-known to those in the art. (see, for example, Wu (1997) at pages 
41-46). Moreover, commercially available kits can be used to synthesize double- 
stranded cDNA molecules. For example, such kits are available from Life 
Technologies, Inc. (Gaithersburg, Maryland), CLONTECH Laboratories, Inc. (Palo 
30 Alto, California), Promega Corporation (Madison, Wisconsin) and Stratagene Cloning 
Systems (La Jolla, California). 

The basic approach for obtaining TGF-beta binding-protein cDNA clones 
can be modified by constructing a subtracted cDNA library which is enriched in TGF- 
binding-protein-specific cDNA molecules. Techniques for constructing subtracted libraries 
35 are well-known to those of skill in the art (see, for example, Sargent, “Isolation of 
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Differentially Expressed Genes,” in Meth. Enzymol 152:423, 1987, and Wu et al. (eds), 
“Construction and Screening of Subtracted and Complete Expression cDNA Libraries,” in 
Methods in Gene Biotechnology, pages 29-65 (CRC Press, Inc. 1997)). 

Various cloning vectors are appropriate for the construction of a cDNA 
5 library. For example, a cDNA library can be prepared in a vector derived from 
bacteriophage, such as a AgtlO vector (see, for example, Huynh et al., “Constructing 
and Screening cDNA Libraries in XgtlO and Agt 1 1,” in DMA Cloning: A Practical 
Approach Vol. /, Glover (ed ), page 49 (IRL Press, 1985); Wu (1997) at pages 47-52). 

Alternatively, double-stranded cDNA molecules can be inserted into a 
10 plasmid vector, such as a pBluescript vector (Stratagene Cloning Systems; La Jolla, 
California), a LambdaGEM-4 (Promega Corp.; Madison, Wisconsin) or other commercially 
available vectors. Suitable cloning vectors also can be obtained from the American Type 
Culture Collection (Rockville, Maryland). 

In order to amplify the cloned cDNA molecules, the cDNA library is 
15 inserted into a prokaryotic host, using standard techniques. For example, a cDNA library 
can be introduced into competent E coli DH5 cells, which can be obtained from Life 
Technologies, Inc. (Gaithersburg, Maryland). 

A human genomic DNA library can be prepared by means well-known in 
the art (see, for example, Ausubel (1995) at pages 5-1 to 5-6; Wu (1997) at pages 307-327). 
20 Genomic DNA can be isolated by lysing tissue with the detergent Sarkosyl, digesting the 
lysate with proteinase K, clearing insoluble debris from the lysate by centrifugation, 
precipitating nucleic acid from the lysate using isopropanol, and purifying resuspended 
DNA on a cesium chloride density gradient. 

DNA fragments that are suitable for the production of a genomic library can 
25 be obtained by the random shearing of genomic DNA or by the partial digestion of genomic 
DNA with restriction endonucleases. Genomic DNA fragments can be inserted into a 
vector, such as a bacteriophage or cosmid vector, in accordance with conventional 
techniques, such as the use of restriction enzyme digestion to provide appropriate termini, 
the use of alkaline phosphatase treatment to avoid undesirable joining of DNA molecules, 
.30 and ligation with appropriate ligases. Techniques for such manipulation are well-known in 
the art (sec, for example, Ausubel (1995) at pages 5-1 to 5-6; Wu (1997) at pages 307-327). 

Nucleic acid molecules that encode a TGF-beta binding-protein gene can 
also be obtained using the polymerase chain reaction (PCR) with oligonucleotide 
primers having nucleotide sequences that are based upon the nucleotide sequences of 
35 the human TGF-beta binding-protein gene, as described herein. General methods for 
screening libraries with PCR are provided by, for example, Yu et al., “Use of the 
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Polymerase Chain Reaction to Screen Phage Libraries,” in Methods in Molecular 
Biology, Vol. 15: PCR Protocols: Current Methods and Applications, White (ed.), 
pages 211-215 (Humana Press, Inc. 1993). Moreover, techniques for using PCR to 
isolate related genes are described by, for example, Preston, “Use of Degenerate 
5 Oligonucleotide Primers and the Polymerase Chain Reaction to Clone Gene Family 
Members,” in Methods in Molecular Biology, Vol. 15: PCR Protocols : Current 
Methods and. Applications, White (ed ), pages 317-337 (Humana Press, Inc. 1993). 

Alternatively, human genomic libraries can be obtained from commercial 
sources such as Research Genetics (Huntsville, AL) and the American Type Culture 
10 Collection (Rockville, Maryland). 

A library containing cDNA or genomic clones can be screened with one or 
more polynucleotide probes based upon SEQ ID NO: I, using standard methods (see, for 
example, Ausubel (1995) at pages 6-1 to 6-11). 

Anti-TGF-beta binding-protein antibodies, produced as described below, 
15 can also be used to isolate DNA sequences that encode TGF-beta binding-protein genes 
from cDNA libraries. For example, the antibodies can be used to screen Agtl 1 
expression libraries, or the antibodies can be used for immunoscreening following 
hybrid selection and translation (see, for example, Ausubel (1995) at pages 6-12 to 6- 
16; Margolis et al , “Screening X expression libraries with antibody and protein 
20 probes,” in DMA Cloning 2: Expression Systems, 2nd Edition, Glover et al. (eds.), 
pages 1-14 (Oxford University Press 1995)). 

The sequence of a TGF-beta binding-protein cDNA or TGF-beta 
binding-protein genomic fragment can be determined using standard methods. 
Moreover, the identification of genomic fragments containing a TGF-beta binding- 
25 protein promoter or regulatory element can be achieved using well-established 
techniques, such as deletion analysis (see, generally, Ausubel (1995)). 

As an alternative, a TGF-beta binding-protein gene can be obtained by 
synthesizing DNA molecules using mutually priming long oligonucleotides and the 
nucleotide sequences described herein (see, for example, Ausubel (1995) at pages 8-8 
30 to 8-9). Established techniques using the polymerase chain reaction provide the ability 
to synthesize DNA molecules at least two kilobases in length (Adang et al., Plant 
Molec. Biol. 21: 1131, 1993; Bambot et al., PCR Methods and Applications 2: 266, 
1993; Dillon et al., “Use of the Polymerase Chain Reaction for the Rapid Construction 
of Synthetic Genes,” in Methods in Molecular Biology, Vol. 15: PCR Protocols: 
35 Current Methods and Applications, White (ed ), pages 263-268, (Humana Press, Inc. 
1993); Holowachuk et al., PCR Methods Appl. 4: 299, 1995). 
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3. Production of TGF-beta binding-protein genes 

Nucleic acid molecules encoding variant TGF-beta binding-protein 
genes can be obtained by screening various cDNA or genomic libraries with 
polynucleotide probes having nucleotide sequences based upon SEQ ID NO:l, 5, 9, 11, 
5 13, or 15, using procedures described above. TGF-beta binding-protein gene variants 

can also be constructed synthetically. For example, a nucleic acid molecule can be 
devised that encodes a polypeptide having a conservative amino acid change, compared 
with the amino acid sequence of SEQ ID NOs: 2, 6, 8, 10, 12, 14, or 16. That is, 
variants can be obtained that contain one or more amino acid substitutions of SEQ ID 
It) NOs: 2, 6, 8, 10, 12, 14 or 16, in which an alkyl amino acid is substituted for an alkyl 
amino acid in a TGF-beta binding-protein amino acid sequence, an aromatic amino acid 
is substituted for an aromatic, amino acid in a TGF-beta binding-protein amino acid 
sequence, a sulfur-containing amino acid is substituted for a sulfur-containing amino 
acid in a TGF-beta binding-protein amino acid sequence, a hydroxy-containing amino 
15 acid is substituted for a hydroxy-containing amino acid in a TGF-beta binding-protein 
amino acid sequence, an acidic amino acid is substituted for an acidic amino acid in a 
TGF-beta binding-protein amino acid sequence, a basic amino acid is substituted for a 
basic amino acid in a TGF-beta binding-protein amino acid sequence, or a dibasic 
monocarboxylic amino acid is substituted for a dibasic monocarboxylic amino acid in a 
20 TGF-beta binding-protein amino acid sequence. 

Among the common amino acids, for example, a “conservative amino 
acid substitution” is illustrated by a substitution among amino acids within each of the 
following groups: (1) glycine, alanine, valine, leucine, and isoleucine, (2) 
phenylalanine, tyrosine, and tryptophan, (3) serine and threonine, (4) aspartate and 
25 glutamate, (5) glutamine and asparagine, and (6) lysine, arginine and histidine. In 
making such substitutions, it is important to, where possible, maintain the cysteine 
backbone outlined in Figure 1 . 

Conservative amino acid changes in a TGF-beta binding-protein gene 
can be introduced by substituting nucleotides for the nucleotides recited in SEQ ID 
30 NO:l. Such “conservative amino acid” variants can be obtained, for example, by 
oligonucleotide-directed mutagenesis, linker-scanning mutagenesis, mutagenesis using 
the polymerase chain reaction, and the like (see Ausubel (1995) at pages 8-10 to 8-22; 
and McPherson (ed ), Directed Mutagenesis: A Practical Approach (IRL Press 1991)). 
The functional ability of such variants can be determined using a standard method, such 
35 as the assay described herein. Alternatively, a variant TGF-beta binding-protein 
polypeptide can be identified by the ability to specifically bind anti-TGF-beta binding- 
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protein antibodies. 

Routine deletion analyses of nucleic acid molecules can be performed to 
obtain “functional fragments” of a nucleic acid molecule that encodes a TGF-beta 
binding-protein polypeptide. As an illustration, DNA molecules having the nucleotide 
5 sequence of SEQ ID NO: 1 can be digested with Ball 1 nuclease to obtain a series of 
nested deletions. The fragments are then inserted into expression vectors in proper 
reading frame, and the expressed polypeptides are isolated and tested for activity, or for 
the ability to bind anti-TGF-beta binding-protein antibodies. One alternative to 
exonuclease digestion is to use oligonucleotide-directed mutagenesis to introduce 
l() deletions or stop codons to specify production of a desired fragment. Alternatively, 
particular fragments of a TGF-beta binding-protein gene can be synthesized using the 
polymerase chain reaction. 

Standard techniques for functional analysis of proteins are described by, 
for example, Treuter et al., Molec. Gen. Genet. 240: 113, 1993; Content et al., 
15 “Expression and preliminary deletion analysis of the 42 kDa 2-5A synthetase induced 
by human interferon,” in Biological Interferon Systems. Proceedings of ISIR-TNO 
Meeting on Interferon Systems, Cantell (ed.), pages 65-72 (Nijhoff 1987); Herschman, 
“The EGF Receptor,” in Control of Animal Cell Proliferation, Vol. /, Boynton et al ., 
(eds.) pages 169-199 (Academic Press 1985); Coumailleau et al., J. Biol. Chem. 
20 2“0:2927O, 1995; Fukunaga et al., Biol. Chem. 270:25291, 1995; Yamaguchi et al., 

Biochetn. Pharmacol. 50: 1295, 1995; and Meisel et al., Plant Molec. Biol. 50:1, 1996. 

The present invention also contemplates functional fragments. of a TGF- 
beta binding-protein gene that have conservative amino acid changes. 

A TGF-beta binding-protein variant gene can be identified on the basis 
25 of structure by determining the level of identity with nucleotide and amino acid 
sequences of SEQ ID NOs: 1, 5, 9, 11, 13, or, 15 and 2, 6, 10, 12, 14, or 16, as 
discussed above. An alternative approach to identifying a variant gene on the basis of 
structure is to determine whether a nucleic acid molecule encoding a potential variant 
TGF-beta binding-protein gene can hybridize under stringent conditions to a nucleic 
30 acid molecule having the nucleotide sequence of SEQ ID Nos: 1, 5, 9, 1 1, 13, or, 15, or 
a portion thereof of at least 15 or 20 nucleotides in length. As an illustration of 
stringent hybridization conditions, a nucleic acid molecule having a variant TGF-beta 
binding-protein sequence can bind with a fragment of a nucleic acid molecule having a 
sequence from SEQ ID NO: 1 in a buffer containing, for example, 5xSSPE (IxSSPE = 
35 180 mM sodium chloride, 10 mM sodium phosphate, 1 mM EDTA (pH 7.7), 

5xDenhardt’s solution (lOOxDenhardt’s = 2% (w/v) bovine serum albumin, 2% (w/v) 
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Ficoll, 2% (w/v) polyvinylpyrrolidone) and 0.5% SDS incubated overnight at 55-60°C. 
Post-hybridization washes at high stringency are typically performed in 0.5xSSC 
(lxSSC = 150 mM sodium chloride, 15 mM trisodium citrate) or in O.SxSSPE at 55-60 
°C. 

5 Regardless of the particular nucleotide sequence of a variant TGF-beta 

binding-protein gene, the gene encodes a polypeptide that can be characterized by its 
functional activity, or by the ability to bind specifically to an anti-TGF-beta binding- 
protein antibody. More specifically, variant TGF-beta binding-protein genes encode 
polypeptides which exhibit at least 50%, and preferably, greater than 60, 70, 80 or 
10 90%, of the activity of polypeptides encoded by the human TGF-beta binding-protein 

gene described herein. 

4. Production of TGF-beta binding-protein in Cultured Cells 

To express a TGF-beta binding-protein gene, a nucleic acid molecule 
encoding the polypeptide must be operably linked to regulatory sequences that control 
15 transcriptional expression in an expression vector and then introduced into a host cell. In 
addition to transcriptional regulatory sequences, such as promoters and enhancers, 
expression vectors can include translational regulatory' sequences and a marker gene which 
is suitable for selection of cells that carry the expression vector. 

Expression vectors that are suitable for production of a foreign protein in 
20 eukaryotic cells typically contain (1) prokaryotic DNA elements coding for a bacterial 
replication origin and an antibiotic resistance marker to provide for the growth and selection 
of the expression vector in a bacterial host; (2) eukaryotic DNA elements that control 
initiation of transcription, such as a promoter, and (3) DNA elements that control the 
processing of transcripts, such as a transcription termination/polyadenylation sequence. 

25 TGF-beta binding-proteins of the present invention are preferably expressed 

in mammalian cells. Examples of mammalian host cells include African green monkey 
kidney cells (Vero; ATCC CRL 1587), human embryonic kidney cells (293-HEK; ATCC 
CRL 1573), baby hamster kidney cells (BHK-21; ATCC CRL 8544), canine kidney cells 
(MDC.K; ATCC CCL 34), Chinese hamster ovary cells (CHO-K1, ATCC CCL61), rat 
30 pituitary cells (GH1, ATCC CCL 82), HeLa S3 cells (ATCC CCL2.2), rat hepatoma cells 
(H-4-I1-E; ATCC CRL 1548) SV40-transformed monkey kidney cells (COS-1; ATCC CRL 
1650) and murine embryonic cells (N1H-3T3; ATCC CRL 1658). 

For a mammalian host, the transcriptional and translational regulatory 
signals may be derived from viral sources, such as adenovirus, bovine papilloma virus, 
35 simian virus, or the like, in which the regulatory signals are associated with a particular gene 




WO 00/32773 



PCT/US99/27990 



25 



which has a high level of expression. Suitable transcriptional and translational regulatory 
sequences also can be obtained from mammalian genes, such as actin, collagen, myosin, and 
metallothionein genes. 

Transcriptional regulatory sequences include a promoter region sufficient to 
5 direct the initiation of RNA synthesis. Suitable eukaryotic promoters include the promoter 
of the mouse metallothionein I gene [Hamer et al., ./. Molec. Appl Genet. 1 . 273, 1982], the 
TK promoter of Herpes virus [McKnight, Cell 31: 355, 1982], the SV-tO early promoter 
[Benoist et al., Nature 290:304, 1981], the Runs sarcoma virus promoter [Gorman et al., 
Proc. Nat’/ Acad Sci. USA “9:6777, 1982], the cytomegalovirus promoter [Foecking et al., 
10 Gene 45: 101, 1980], and the mouse mammary tumor virus promoter (see, generally, 
Etcheverry, “Expression of Engineered Proteins in Mammalian Cell Culture," in Protein 
Engineering: Principles and Practice, Cleland et al (eds.), pages 163-181 (John Wiley & 
Sons, Inc. 1996)). 

Alternatively, a prokaryotic promoter, such as the bacteriophage T3 RNA 
15 polymerase promoter, can be used to control TGF-beta binding-protein gene expression in 
mammalian cells if the prokaryotic promoter is regulated by a eukaryotic promoter (Zhou et 
al., Mol. Cell. Biol. 70:4529, 1990; Kaufman et al., Niicl. Acids Res. 79:4485, 1991). 

TGF-beta binding-protein genes may also be expressed in bacterial, yeast, 
insect, or plant cells. Suitable promoters that can be used to express TGF-beta binding- 
20 protein polypeptides in a prokaryotic host are well-known to those of skill in the art and 
include promoters capable of recognizing the T4, T3, Sp6 and T7 polymerases, the P R 
and P L promoters of bacteriophage lambda, the trp , recA, heat shock, lacUV'5 , tac, Ipp- 
lacSpr, phoA, and lacZ promoters of E. coli, promoters of B. subti/is , the promoters of 
the bacteriophages of Bacillus , Streptomyces promoters, the ini promoter of bacterio- 
25 phage lambda, the bla promoter of pBR322, and the CAT promoter of the chloram- 
phenicol acetyl transferase gene. Prokaryotic promoters have been reviewed by Glick, 
J. Ind. Microbiol. 7:277, 1987, Watson et al., Molecular Biology of the Gene , 4th Ed. 
(Benjamin Cummins 1987), and by Ausubel et al. (1995). 

Preferred prokaryotic hosts include E. coli and Bacillus subtilus. 
30 Suitable strains of E. coli include BL21(DE3), BL21(DE3)pLysS, BL21(DE3)pLysE, 
DH1, DH4I, DH5, DH51, DH5IF’, DH5IMCR, DH10B, DH10B/p3, DH11S, C600, 
HB101, JM10I, JM105, JM109, JM110, K38, RR1, Y1088, Y1089, CSH18, ER1451, 
and ER1647 (see, for example, Brown (Ed.), Molecular Biology Labfax (Academic 
Press 1991)). Suitable strains of Bacillus subtilus include BR151, YB886, Ml 11 9, 
35 MI120, and B170 (see, for example, Hardy, “Bacillus Cloning Methods," in DNA 

Cloning: A Practical Approach, Giover (Ed.) (IRL Press 1985)). 
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Methods for expressing proteins in prokaryotic hosts are well-known to 
those of skill in the art (see, for example, Williams et al., “Expression of foreign 
proteins in E. co/i using plasmid vectors and purification of specific polyclonal 
antibodies,” in DNA Cloning 2: Expression Systems, 2nd Edition, Glover et al. (eds), 
5 page 15 (Oxford University Press 1995); Ward et al., “Genetic Manipulation and 
Expression of Antibodies,” in Monoclonal Antibodies: Principles and Applications, 
page 137 (Wiley-Liss, Inc. 1995); and Georgiou, “Expression of Proteins in Bacteria,” 
in Protein Engineering: Principles and Practice, Cleland et al. (eds ), page 101 (John 
Wiley & Sons, Inc. 1996)). 

10 The baculovirus system provides an efficient means to introduce cloned 

TGF-beta binding-protein genes into insect cells. Suitable expression vectors are based 
upon the Autographa ca/ifornica multiple nuclear polyhedrosis virus (AcMNPV), and 
contain well-known promoters such as Drosophila heat shock protein (hsp) 70 
promoter, Autographa ca/ifornica nuclear polyhedrosis virus immediate-early gene 
15 promoter (ie-J) and the delayed early 39K promoter, baculovirus p 10 promoter, and the 
Drosophila metallothionein promoter. Suitable insect host cells include cell lines 
derived from IPLB-5’/-21, a Spodoptera frugiperda pupal ovarian cell line, such as SJ9 
(ATCC CRL 1711), .S721AE, and Sf2\ (Invitrogen Corporation; San Diego, CA), as 
well as Drosophila Schneider-2 cells. Established techniques for producing 
20 recombinant proteins in baculovirus systems are provided by Bailey et al., 
“Manipulation of Baculovirus Vectors,” in Methods in Molecular Biology , Volume 7 : 
Gene Transfer and Expression Protocols, Murray (ed), pages 147-168 (The Humana 
Press, Inc. 1991), by Patel et al., “The baculovirus expression system,” in DNA Cloning 
2: Expression Systems, 2nd Edition, Glover et at. (eds ), pages 205-244 (Oxford 
25 University Press 1995), by Ausubel (1995) at pages 16-37 to 16-57, by Richardson 
(ed ), Baculovirus Expression Protocols (The Humana Press, Inc. 1995), and by 
Lucknow, “Insect Cell Expression Technology,” in Protein Engineering: Principles 
and Practice, Cleland et al. (eds.), pages 183-218 (John Wiley & Sons, Inc. 1996). 

Promoters for expression in yeast include promoters from GAL I 
30 (galactose), PGK (phosphoglycerate kinase), ADH (alcohol dehydrogenase), AOX1 
(alcohol oxidase), HIS4 (histidinol dehydrogenase), and the like. Many yeast cloning 
vectors have been designed and are readily available. These vectors include Yip-based 
vectors, such as Ylp5, YRp vectors, such as YRpl7, YEp vectors such as YEpl3 and 
YCp vectors, such as YCpl9. One skilled in the art will appreciate that there are a 
35 wide variety of suitable vectors for expression in yeast cells. 

Expression vectors can also be introduced into plant protoplasts, intact plant 
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tissues, or isolated plant cells. General methods of culturing plant tissues are provided, for 
example, by Miki et al., “Procedures for Introducing Foreign DNA into Plants,” in Methods 
in Plant Molecular Biology > and Biotechnology , Glick et al. (eds), pages 67-88 (CRC Press, 
1993). 

5 An expression vector can be introduced into host cells using a variety of 

standard techniques including calcium phosphate transfection, liposome-mediated 
transfection, microprojectile-mediated delivery, electroporation, and the like Preferably, 
the transfected cells are selected and propagated to provide recombinant host cells that 
comprise the expression vector stably integrated in the host cell genome. Techniques for 
10 introducing vectors into eukaryotic cells and techniques for selecting such stable 
transformants using a dominant selectable marker are described, for example, by Ausubel 
(1995) and by Murray (ed.), Gene Transfer and Expression Protocols (Humana Press 
1991). Methods for introducing expression vectors into bacterial, yeast, insect, and plant 
cells are also provided by Ausubel (1995). 

15 General methods for expressing and recovering foreign protein produced by 

a mammalian cell system is provided by, for example, Etcheverry, “Expression of 
Engineered Proteins in Mammalian Cell Culture,” in Protein Engineering: Principles and. 
Practice , Cleland et al. (eds ), pages 163 (Wiley-Liss, Inc. 1996). Standard techniques for 
recovering protein produced by a bacterial system is provided by, for example, 
20 Grisshammer et al. , “Purification of over-produced proteins from E. coli cells,” in DNA 
Cloning 2: Expression Systems , 2nd Edition , Glover et al. (eds ), pages 59-92 (Oxford 
University Press 1995). Established methods for isolating recombinant proteins from a 
baculovirus system are described by Richardson (ed ), Bcicii/ovirns Expression 
Protocols '{Thz Humana Press, Inc., 1995). 

25 More generally, TGF-beta binding-protein can be isolated by standard 

techniques, such as affinity chromatography, size exclusion chromatography, ion 
exchange chromatography, HPLC and the like. Additional variations in TGF-beta 
binding-protein isolation and purification can be devised by those of skill in the art. 
For example, anti-TGF-beta binding-protein antibodies, obtained as described below, 
30 can be used to isolate large quantities of protein by immunoaffinity purification. 

5. Production of Antibodies to TGF-beta binding-proteins 

Antibodies to TGF-beta binding-protein can be obtained, for example, 
using the product of an expression vector as an antigen. Particularly useful anti-TGF- 
beta binding-protein antibodies “bind specifically” with TGF-beta binding-protein of 
35 Sequence ED Nos. 2, 6, 10, 12, 14, or 16, but not to other TGF-beta binding-proteisn 
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such as Dan, Cerberus, SCGF, or Gremlin. Antibodies of the present invention 
(including fragments and derivatives thereof) may be a polyclonal or, especially a 
monoclonal antibody. The antibody may belong to any immunoglobulin class, and may be 
for example an IgG, for example IgG,, IgG 2 , IgG 3 , IgG 4 , IgE; IgM, or IgA antibody. It 
5 may be of animal, for example mammalian origin, and may be for example a murine, rat, 
human or other primate antibody. Where desired the antibody may be an internalising 
antibody. 

Polyclonal antibodies to recombinant TGF-beta binding-protein can be 
prepared using methods well-known to those of skill in the art (see, for example. Green 
10 et a!., ‘‘Production of Polyclonal Antisera,” in Immunochemical Protocols (Manson, 
ed.), pages 1-5 (Humana Press 1992); Williams et al., “Expression of foreign proteins 
in E. coli using plasmid vectors and purification of specific polyclonal antibodies,” in 
DNA Cloning 2: Expression Systems, 2nd Edition, Glover et al. (eds.), page 15 (Oxford 
University Press 1995)). Although polyclonal antibodies are typically raised in animals 
15 such as rats, mice, rabbits, goats, or sheep, an anti-TGF-beta binding-protein antibody 
of the present invention may also be derived from a subhuman primate antibody. 
General techniques for raising diagnostically and therapeutically useful antibodies in 
baboons may be found, for example, in Goldenberg et al., international patent 
publication No. WO 91/11465 (1991), and in Losman et al., hit. J. Cancer 46: 310, 
20 1990. 

The antibody should comprise at least a variable region domain. The 
variable region domain may be of any size or amino acid composition and will generally 
comprise at least one hypervariable amino acid sequence responsible for antigen binding 
embedded in a framework sequence. In general terms the variable (V) region domain may 
25 be any suitable arrangement of immunoglobulin heavy (V H ) and/or light (V L ) chain 
variable domains. Thus for example the V region domain may be monomeric and be a V H 
or V L domain where these are capable of independently binding antigen with acceptable 
affinity. Alternatively the V region domain may be dimeric and contain V H -V H . V H -V L , or 
V L -V L , dimers in which the V H and V L chains are non-covalently associated (abbreviated 
30 hereinafter as F v ). Where desired, however, the chains may be covalently coupled either 
directly, for example via a disulphide bond between the two variable domains, or through a 
linker, for example a peptide linker, to form a single chain domain (abbreviated hereinafter 
as scF v ). 

The variable region domain may be any naturally occuring variable domain 
35 or an engineered version thereof. By engineered version is meant a variable region domain 
which has been created using recombinant DNA engineering techniques. Such engineered 
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versions include those created for example from natural antibody variable regions by 
insertions, deletions or changes in or to the amino acid sequences of the natural antibodies. 
Particular examples of this type include those engineered variable region domains 
containing at least one CDR and optionally one or more framework amino acids from one 
5 antibody and the remainder of the variable region domain from a second antibody. 

The variable region domain may be covalently attached at a C-terminal 
amino acid to at least one other antibody domain or a fragment thereof. Thus, for example 
where a V H domain is present in the variable region domain this may be linked to an 
immunoglobulin C H 1 domain or a fragment thereof. Similarly a V L domain may be linked 
10 to a C K domain or a fragment thereof. In this way for example the antibody may be a Fab 
fragment wherein the antigen binding domain contains associated V H and V L domains 
covalently linked at their C-termini to a CHI and C K domain respectively. The CHI 
domain may be extended with further amino acids, for example to provide a hinge region 
domain as found in a Fab' fragment, or to provide further domains, such as antibody CH2 
15 and CH3 domains. 

Another form of an antibody fragment is a peptide coding for a single 
complementarity-determining region (CDR). CDR peptides (“minimal recognition 
units”) can be obtained by constructing genes encoding the CDR of an antibody of 
interest. Such genes are prepared, for example, by using the polymerase chain reaction 
20 to synthesize the variable region from RNA of antibody-producing cells (see, for 
example, Larrick et al., Methods: A Companion to Methods in Enzymology 2: 106, 
1991; Courtenay-Luck, “Genetic Manipulation of Monoclonal Antibodies,” in 
Monoclonal Antibodies: Production, Engineering and. Clinical Application, Ritter et al. 
(eds.), page 166 (Cambridge University. Press 1995); and Ward et al., “Genetic 
25 Manipulation and Expression of Antibodies,” in Monoclonal Antibodies: Principles 
and Applications, Birch et al., (eds.), page 137 (Wiley-Liss, Inc. 1995)). 

Antibodies for use in the invention may in general be monoclonal (prepared 
by conventional immunisation and cell fusion procedures) or in the case of fragments, 
derived therefrom using any suitable standard chemical e.g. reduction or enzymatic cleavage 
30 and/or digestion techniques, for example by treatment with pepsin. 

More specifically, monoclonal anti-TGF-beta binding-protein antibodies 
can be generated utilizing a variety of techniques. Rodent monoclonal antibodies to 
specific antigens may be obtained by methods known to those skilled in the art (see, for 
example, Kohler et al„ Nature 256:495, 1975; and Coligan et al. (eds ), Current 
35 Protocols in Immunology, 1:2. 5. 1-2. 6.7 (John Wiley & Sons 1991) [“Coligan”]; 
Picksley et al., “Production of monoclonal antibodies against proteins expressed in E. 
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coll'' in DNA Cloning 2: Expression Systems, 2nd Edition , Glover et al. (eds.), page 93 
(Oxford University Press 1995)). 

Briefly, monoclonal antibodies can be obtained by injecting mice with a 
composition comprising a TGF-beta binding-protein gene product, verifying the 
5 presence of antibody production by removing a serum sample, removing the spleen to 
obtain B-lymphocytes, fusing the B-lymphocytes with myeloma cells to produce 
hybridomas, cloning the hybridomas, selecting positive clones which produce 
antibodies to the antigen, culturing the clones that produce antibodies to the antigen, 
and isolating the antibodies from the hybridoma cultures. 

10 In addition, an anti-TGF-beta binding-protein antibody of the present 

invention may be derived from a human monoclonal antibody. Human monoclonal 
antibodies are obtained from transgenic mice that have been engineered to produce specific 
human antibodies in response to antigenic challenge. In this technique, elements of the 
human heavy and light chain locus are introduced into strains of mice derived from 
15 embryonic stem cell lines that contain targeted disruptions of the endogenous heavy chain 
and light chain loci. The transgenic mice can synthesize human antibodies specific for 
human antigens, and the mice can be used to produce human antibody-secreting 
hybridomas. Methods for obtaining human antibodies from transgenic mice are described, 
for example, by Green et al., Nature Genet. 7:13, 1994; Lonberg et al„ Nature 365:856, 
20 1994; and Taylor et al., Int. Immun. 6: 579, 1994. 

Monoclonal antibodies can be isolated and purified from hybridoma 
cultures by a variety of well-established techniques. Such isolation techniques include 
affinity chromatography with Protein-A Sepharose, size-exclusion chromatography, 
and ion-exchange chromatography (see, for example, Coligan at pages 2.7.1-2.7.12 and 
25 pages 2.9. 1 -2.9.3; Baines et al., “Purification of Immunoglobulin G (IgG).” in Methods 
in Molecular Biology, Vol. JO, pages 79-104 (The Humana Press, Inc. 1992)). 

For particular uses, it may be desirable to prepare fragments of anti- 
TGF-beta binding-protein antibodies. Such antibody fragments can be obtained, for 
example, by proteolytic hydrolysis of the antibody. Antibody fragments can be 
30 obtained by pepsin or papain digestion of whole antibodies by conventional methods. 
As an illustration, antibody fragments can be produced by enzymatic cleavage of 
antibodies with pepsin to provide a 5S fragment denoted F(ab’)-,. This fragment can be 
further cleaved using a thiol reducing agent to produce 3.5S Fab’ monovalent 
fragments. Optionally, the cleavage reaction can be performed using a blocking group 
35 for the sulfhydryl groups that result from cleavage of disulfide linkages. As an ■ 
alternative, an enzymatic cleavage using pepsin produces two monovalent Fab 
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fragments and an Fc fragment directly. These methods are described, for example, by 
Goldenberg, U S. patent No. 4,331,647, Nisonoff et al., Arch Biochew. Biophys. 
89\ 230, 1960, Porter, Biochem. ./. 75:119, 1959, Edelman et al., in Methods in 
Enzymology 7:422 (Academic Press 1967), and by Coligan at pages 2.8.1-2.8.10 and 
5 2.10.-2.10.4. 

Other methods of cleaving antibodies, such as separation of heavy chains 
to form monovalent light-heavy chain fragments, further cleavage of fragments, or 
other enzymatic, chemical or genetic techniques may also be used, so long as the 
fragments bind to the antigen that is recognized by the intact antibody. 

10 Alternatively, the antibody may be. a recombinant or engineered antibody 

obtained by the use of recombinant DNA techniques involving the manipulation and re- 
expression of DNA encoding antibody variable and/or constant regions. Such DNA is 
known and/or is readily available from DNA libraries including for example phage- 
antibody libraries (see Chiswell, D J and McCafferty, J. Tibtech. JO 80-84 (1992)) or where 
15 desired can be synthesised. Standard molecular biology and/or chemistry procedures may 
be used to sequence and manipulate the DNA, for example, to introduce codons to create 
cysteine residues, to modify, add or delete other amino acids or domains as desired. 

From here, one or more replicable expression vectors containing the DNA 
may be prepared and used to transform an appropriate cell line, e.g. a non-producing 
20 myeloma cell line, such as a mouse NSO line or a bacterial, e.g. E.coli line, in which 
production of the antibody will occur. In order to obtain efficient transcription and 
translation, the DNA sequence in each vector should include appropriate regulatory 
sequences, particularly a promoter and leader sequence operably linked to the variable 
domain sequence. Particular methods for producing antibodies in this way are generally 
25 well known and routinely used. For example, basic molecular biology procedures are 
described by Maniatis et a! (Molecular Cloning, Cold Spring Harbor Laboratory, New 
York, 1989); DNA sequencing can be performed as described in Sanger et al (PNAS 74, 
5463, (1977)) and the Amersham International pic sequencing handbook; and site directed 
mutagenesis can be carried out according to the method of Kramer el a! (Nucl. Acids Res. 
30 J2, 9441, (1984)) and the Anglian Biotechnology Ltd handbook. Additionally, there are 

numerous publications, detailing techniques suitable for the preparation of antibodies by 
manipulation of DNA, creation of expression vectors and transformation of appropriate 
cells, for example as reviewed by Mountain A and Adair, J R in Biotechnology and Genetic 
Engineering Reviews (ed. Tombs, M P, JO, Chapter 1, 1992, Intercept, Andover, UK) and 
35 in International Patent Specification No. WO 91/09967. 

Where desired, the antibody according to the invention may have one or 
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more effector or reporter molecules attached to it and the invention extends to such 
modified proteins. The effector or reporter molecules may be attached to the antibody 
through any available amino acid side-chain, terminal amino acid or, where present 
carbohydrate functional group located in the antibody, always provided of course that this 
5 does not adversely affect the binding properties and eventual usefulness of the molecule. 
Particular functional groups include, for example any free amino, imino, thiol, hydroxyl, 
carboxyl or aldehyde group. Attachment of the antibody and the effector and/or reporter 
molecule(s) may be achieved via such groups and an appropriate functional group in the 
effector or reporter molecules. The linkage may be direct or indirect, through spacing or 
10 bridging groups. 

Effector molecules include, for example, antineoplastic agents, toxins (such 
as enzymatically active toxins of bacterial or plant origin and fragments thereof e.g. ricin 
and fragments thereof) biologically active proteins, for example enzymes, nucleic acids and 
fragments thereof, e g. DNA, RNA and fragments thereof, naturally occurring and synthetic 
15 polymers e.g. polysaccharides and polyalkylene polymers such as polyethylene glycol) and 
derivatives thereof, radionuclides, particularly radioiodide, and chelated metals. Suitable 
reporter groups include chelated metals, fluorescent compounds or compounds which may 
be detected by NMR or ESR spectroscopy. 

Particular antineoplastic agents include cytotoxic and cytostatic agents, for 
20 example alkylating agents, such as nitrogen mustards (e.g. chlorambucil, melphalan, 
mechlorethamine, cyclophosphamide, or uracil mustard) and derivatives thereof, 
triethylenephosphoramide, triethylenethiophosphor-amide, = busulphan, or . cisplatin, 
antimetabolites, such as methotrexate, fluorouracil, floxuridine, cytarabine, mercaptopurine, 
thioguanine, fluoroacetic acid or fluorocitric acid, antibiotics, such as bleomycins (e.g. 
25 bleomycin sulphate), doxorubicin, daunorubicin, mitomycins (e.g. mitomycin C), 
actinomycins (e.g. dactinomycin) plicamycin, calichaemicin and derivatives thereof, or 
esperamicin and derivatives thereof; mitotic inhibitors, such as etoposide, vincristine or 
vinblastine and derivatives thereof; alkaloids, such as ellipticine; polyols such as taxicin-I or 
taxicin-II; hormones, such as androgens (e.g. dromostanolone or testolactone), progestins 
30 (e.g. megestrol acetate or medroxyprogesterone acetate), estrogens (e.g. dimethylstilbestrol 

diphosphate, polyestradiol phosphate or estramustine phosphate) or antiestrogens (e.g. 
tamoxifen); anthraquinones, such as mitoxantrone, ureas, such as hydroxyurea; hydrazines, 
such as procarbazine; or imidazoles, such as dacarbazine. 

Particularly useful effector groups are calichaemicin and derivatives thereof 
35 (see for example South African Patent Specifications Nos. 85/8794, 88/8 1 27 and 90/2839), 

Chelated metals include chelates of di-or tripositive metals having a 
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coordination number from 2 to 8 inclusive. Particular examples of such metals include 
technetium (Tc), rhenium (Re), cobalt (Co), copper (Cu), gold (Au), silver (Ag), lead (Pb), 
bismuth (Bi), indium (In), gallium (Ga), yttrium (Y), terbium (Tb), gadolinium (Gd), and 
scandium (Sc). In general the metal is preferably a radionuclide. Particular radionuclides 
5 include ""'Tc, "V “V 58 Co, “Co. 67 Cu, l,5 Au, "V 203 Pb, “si, 24, Bi. 
1 1 'in, 67 Ga, '"'Ga, “y, W Y, 160 Tb, l53 0d and 47 Sc. 

The chelated metal may be for example one of the above types of metal 
chelated with any suitable polydentate chelating agent, for example acyclic or cyclic 
polyamines, polyethers, (e.g. crown ethers; and derivatives thereof); polyamides; porphyrins; 
10 and carbocyclic derivatives. 

In general, the type of chelating agent will depend on the metal in use. One 
particularly useful group of chelating agents in conjugates according to the invention, 
however, are acyclic and cyclic polyamines, especially polyaminocarboxylic acids, for 
example diethylenetriaminepentaacetic acid and derivatives thereof, and macrocyclic 
15 amines, e.g. cyclic tri-aza and tetra-aza derivatives (for example as described in 
International Patent Specification No. WO 92/22583); and polyamides, especially 
desferrioxamine and derivatives thereof. 

Thus for example w r hen it is desired to use a thiol group in the antibody as 
the point of attachment this may be achieved through reaction with a thiol reactive group 
20 present in the effector or reporter molecule. Examples of such groups include an a- 
halocarboxylic acid or ester, e.g. iodoacetamide, an imide, e.g. maleimide, a vinyl sulphone, 
or a disulphide. These and other suitable linking procedures are generally and more 
particularly described in International Patent Specifications Nos. WO 93/06231, WO 
92/22583, WO 90/091 195 and WO 89/01476. 



25 Assays for Selecting Molecules Which Increase Bone Density 

As discussed above, the present invention provides methods for selecting 
and/or isolating compounds which are capable of increasing bone density. For example, 
within one aspect of the present invention methods are provided for determining 
whether a selected molecule is capable of increasing bone mineral content, comprising 
30 the steps of (a) mixing a selected molecule with TGF-beta binding protein and a 
selected member of the TGF-beta family of proteins, (b) determining whether the 
selected molecule stimulates signaling by the TGF-beta family of proteins, or inhibits 
the binding of the TGF-beta binding protein to the TGF-beta family of proteins. 
Within certain embodiments, the molecule enhances the ability of TGF-beta to 
35 function as a positive regulator of mesenchymal cell differentiation. 
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Within other aspects of the invention, methods are provided for 
determining whether a selected molecule is capable of increasing bone mineral content, 
comprising the steps of (a) exposing a selected molecule to cells which express TGF- 
beta binding-protein and (b) determining whether the expression (or activity) of TGF- 
5 beta binding-protein from said exposed cells decreases, and therefrom determining 
whether the compound is capable of increasing bone mineral content. Within one 
embodiment, the cells are selected from the group consisting of the spontaneously 
transformed or untransformed normal human bone from bone biopsies and rat parietal 
bone osteoblasts. Such methods may be accomplished in a wide variety of assay 
10 formats including, for example, Countercurrent Immuno-Electrophoresis (CIEP), 
Radioimmunoassays, Radioimmunoprecipitations, Enzyme-Linked Immuno-Sorbent 
Assays (ELISA), Dot Blot assays. Inhibition or Competition assays, and sandwich 
assays (see U S. Patent Nos. 4,376,1 10 and 4,486,530; see also Antibodies: A 
Laboratory Manual, supra). 

15 Representative embodiments of such assays are provided below in 

Examples 5 and 6. Briefly, a family member of the TGF-beta super-family or a TGF- 
beta binding protein is first bound to a solid phase, followed by addition of a candidate 
molecule. The labeled family member of the TGF-beta super-family or a TGF-beta 
binding protein is then added to the assay, the solid phase washed, and the quantity of 
20 bound or labeled TGF-beta super-family member or TGF-beta binding protein on the 
solid support determined. Molecules which are suitable for use in increasing bone 
mineral content as described herein are those molecules which decrease the binding of 
TGF-beta binding protein to a member or members of the TGF-beta super-family in a 
statistically significant manner. Obviously, assays suitable for use within the present 
25 invention should not be limited to the embodiments described within Examples 2 and 3. 
In particular, numerous parameters may be altered, such as by binding TGF-beta to a 
solid phase, or by elimination of a solid phase entirely. 

Within other aspects of the invention, methods are provided for 
determining whether a selected molecule is capable of increasing bone mineral content, 
30 comprising the steps of (a) exposing a selected molecule to cells which express TGF- 
beta and (b) determining whether the activity of TGF-beta from said exposed cells is 
altered, and therefrom determining whether the compound is capable of increasing bone 
mineral content. Similar to the above described methods, a wide variety of methods 
may be utilized to assess the changes of TGF-beta binding-protein expression due to a 
35 selected test compound. 
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For example, within one aspect of the present invention methods are 
provided for determining whether a selected molecule is capable of increasing bone 
mineral content, comprising the steps of (a) mixing a selected molecule with TGF-beta- 
binding-protein and a selected member of the TGF-beta family of proteins, 
5 (b) determining whether the selected molecule up-regulates the signaling of the TGF- 

beta family of proteins, or inhibits the binding of the TGF-beta binding-protein to the 
TGF-beta family of proteins. Within certain embodiments, the molecule enhances the 
ability of TGF-beta to function as a positive regulator of mechemchymal cell 
differentiation. 

>0 Similar to the above described methods, a wide variety of methods may 

be utilized to assess stimulation of TGF-beta due to a selected test compound. One 
such representative method is provided below in Example 6 (see also Durham etal., 
Endo. 73/5:1374-1380. 

Within yet other aspects of the present invention, methods are provided 

15 for determining whether a selected molecule is capable of increasing bone mineral 
content, comprising the step of determining whether a selected molecule inhibits the 
binding of TGF-beta binding-protein to bone, or an analogue thereof. As utilized 
herein, it should be understood that bone or analogues thereof refers to hydroxyapatite, 
or a surface composed of a powdered form of bone, crushed bone or intact bone. 

20 Similar to the above described methods, a wide variety of methods may be utilized to 
assess the inhibition of TGF-beta binding-protein localization to bone matrix. One 
, such representative method is provided below in Example 7. 

It should be noted that while the methods recited herein may refer to the 
analysis of an individual test molecule, that the present invention should not be so 

25 limited. In particular, the selected molecule may be contained within a mixture of 
compounds. Hence, the recited methods may further comprise the step of isolating a 
molecule which inhibits the binding of TGF-beta binding-protein to a TGF-beta family 
member. 

Candidate Molecules 

30 A wide variety of molecules may be assayed for their ability to inhibit 

the binding of TGF-beta binding-protein to a TGF-beta family member. Representative 
examples which are discussed in more detail below include organic molecules, proteins 
or peptides, and nucleic acid molecules. Although it should be evident from the 
discussion below that the candidate molecules described herein may be utilized in the 
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assays described herein, it should also be readily apparent that such molecules can also 
be utilized in a variety of diagnostic and therapeutic settins. 

1. Organic Molecules 

Numerous organic molecules may be assayed for their ability to inhibit 
5 the binding of TGF-beta binding-protein to a TGF-beta family member. 

For example, within one embodiment of the invention suitable organic 
molecules may be selected from either a chemical library, wherein chemicals are 
assayed individually, or from combinatorial chemical libraries where multiple 
compounds are assayed at once, then deconvoluted to determine and isolate the most 
10 active compounds. 

Representative examples of such combinatorial chemical libraries 
include those described by Agrafiotis etal., “System and method of automatically 
generating chemical compounds with desired properties,” U.S. Patent No. 5,463,564; 
Armstrong, R.W., “Synthesis of combinatorial arrays of organic compounds through 
15 the use of multiple component combinatorial array syntheses,” WO 95/02566; Baldwin, 
J J. etal., “Sulfonamide derivatives and their use,” WO 95/24186; Baldwin, J.J. etal., 
“Combinatorial dihydrobenzopyran library,” WO 95/30642; Brenner, S., “New kit for 
preparing combinatorial libraries,” WO 95/16918; Chenera, B. etal., “Preparation of 
library' of resin-bound aromatic carbocyclic compounds,” WO 95/16712; Ellman, J.A., 
20 “Solid phase and combinatorial synthesis of benzodiazepine compounds on a solid 
support,” U.S. Patent No. 5,288,514; Felder, E. et al., “Novel combinatorial compound 
libraries,” WO 95/16209; Lerner, R. et al., “Encoded combinatorial chemical libraries,” 
WO 93/20242; Pavia, M.R. etal., “A method for preparing and selecting 
pharmaceutically useful non-peptide compounds from a structurally diverse universal 
25 library',” WO 95/04277; Summerton, J.E. and D.D. Weller, “Morpholino-subunit 
combinatorial library and method,” U.S. Patent No. 5,506,337; Holmes, C., “Methods 
for the Solid Phase Synthesis of Thiazolidinones, Metathiazanones, and Derivatives 
thereof,” WO 96/00148; Phillips, G.B. and G.P. Wei, “Solid-phase Synthesis of 
Benzimidazoles,” Tet. Letters 37:4887-90, 1996; Ruhland, B. etal., “Solid-supported 
30 Combinatorial Synthesis of Structurally Diverse (3-Lactams,” ./. Amer. Chem. Soc. 
Ill: 253-4, 1996; Look, G.C. etal., “The lndentification of Cyclooxygenase- 1 
Inhibitors from 4-Thiazolidinone Combinatorial Libraries,” Bioorg and Med Chem. 
Letters <5:707-12, 1996. 
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2. Proteins and Peptides 

A wide range of proteins and peptides may likewise be utilized as 
candidate molecules for inhibitors of the binding of TGF-beta binding-protein to a 
TGF-beta family member. 

5 a. Combinatorial Peptide Libraries 

Peptide molecules which are putative inhibitors of the binding of TGF- 
beta binding-protein to a TGF-beta family member may be obtained through the 
screening of combinatorial peptide libraries. Such libraries may either be prepared by 
one of skill in the art (see e.g, U.S. Patent Nos. 4,528,266 and 4,359,535, and Patent 
10 Cooperation Treaty Publication Nos. WO 92/15679, WO 92/15677, WO 90/07862, WO 
90/02809, or purchased from commercially available sources ( e.g ., New England 
Biolabs Ph.D ™ Phage Display Peptide Library Kit). 

b. Antibodies 

Antibodies which inhibit the binding of TGF-beta bindinu-protein to a 
15 TGF-beta family member may readily be prepared given the disclosure provided 
herein. Within the context of the present invention, antibodies are understood to 
include monoclonal antibodies, polyclonal antibodies, anti-idiotypic antibodies, 
antibody fragments (e.g.. Fab, and F(ab’)2, F v variable regions, or complementarity 
determining regions). As discussed above, antibodies are understood to be specific 
20 against TGF-beta binding-protein , or against a specific TGF-beta family member, if 
they bind with a K a of greater than or equal to lO^M, preferably greater than or equal 
to 10^M, and do not bind to other TGF-beta binding-proteins, or, bind with a K a of 
less than or equal to 10^M. Furthermore, antibodies of the present invention should 
block or inhibit the binding of TGF-beta binding-protein to a TGF-beta family member. 
25 The affinity of a monoclonal antibody or binding partner, as well as 

inhibition of binding can be readily determined by one of ordinary skill in the art (see 
Scatchard, Ann, N.Y. Acad Sci. 57:660-672, 1949). 

Briefly, polyclonal antibodies may be readily generated by one of 
ordinary skill in the art from a variety of warm-blooded animals such as horses, cows, 
30 various fowl, rabbits, mice, or rats. Typically, the TGF-beta binding-protein or unique 
peptide thereof of 13-20 amino acids (preferably conjugated to keyhole limpet 
hemocyanin by cross-linking with glutaraldehyde) is utilized to immunize the animal 
through intraperitoneal, intramuscular, intraocular, or subcutaneous injections, along 
with an adjuvant such as Freund’s complete or incomplete adjuvant. Following several 
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booster immunizations, samples of serum are collected and tested for reactivity to the 
protein or peptide. Particularly preferred polyclonal antisera will give a signal on one 
of these assays that is at least three times greater than background. Once the titer of the 
animal has reached a plateau in terms of its reactivity to the protein, larger quantities of 
5 antisera may be readily obtained either by weekly bleedings, or by exsanguinating the 
animal. 

Monoclonal antibodies may also be readily generated using conventional 
techniques {see U.S. Patent Nos. RE 32,01 1, 4,902,614, 4,543,439, and 4,411,993 
which are incorporated herein by reference; see also Monoclonal Antibodies, 
10 Hyhridamas: A New Dimension in Biological Analyses, Plenum Press, Kennett, 

McKearn, and Bechtol (eds.), 1980, and Antibodies: A Laboratory Manual, Harlow and 
Lane (eds.), Cold Spring Harbor Laboratory Press, 1988, which are also incorporated 
herein by reference). 

Briefly, within one embodiment a subject animal such as a rat or mouse 
15 is immunized with TGF-beta binding-protein or portion thereof as described above. 
The protein may be admixed with an adjuvant such as Freund’s complete or incomplete 
adjuvant in order to increase the resultant immune response. Between one and three 
weeks after the initial immunization the animal may be reimmunized with another 
booster immunization, and tested for reactivity to the protein utilizing assays described 
20 above. Once the animal has reached a plateau in its reactivity to the injected protein, it 
is sacrificed, and organs which contain large numbers of B cells such as the spleen and 
lymph nodes are harvested. 

Cells which are obtained from the immunized animal may be 
immortalized by infection with a virus such as the Epstein-Barr virus (EBV) {see 
25 Glasky and Reading, Hybridoma S(4):377-389, 1989). Alternatively, within a 
preferred embodiment, the harvested spleen and/or lymph node cell suspensions are 
fused with a suitable myeloma cell in order to create a “hybridoma” which secretes 
monoclonal antibody. Suitable myeloma lines include, for example, NS-1 (ATCC No. 
TIB 18), and P3X63 - Ag 8.653 (ATCC No. CRL 1580). 

30 Following the fusion, the cells may be placed into culture plates 

containing a suitable medium, such as RPMI 1640, or DMEM (Dulbecco’s Modified 
Eagles Medium) (JRH Biosciences, Lenexa, Kansas), as well as additional ingredients, 
such as fetal bovine serum (FBS, i.e., from Hyclone, Logan, Utah, or JRH 
Biosciences). Additionally, the medium should contain a reagent which selectively 
35 allows for the growth of fused spleen and. myeloma cells such as HAT (hypoxanthine, 
aminopterin, and thymidine) (Sigma Chemical Co., St. Louis, Missouri). After about 
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seven days, the resulting fused cells or hybridomas may be screened in order to 
determine the presence of antibodies which are reactive against TGF-beta binding- 
protein (depending on the antigen used), and which block or inhibit the binding of 
TGF-beta binding-protein to a TGF-beta family member. 

^ A wide variety of assays may be utilized to determine the presence of 

antibodies which are reactive against the proteins of the present invention, including for 
example countercurrent immuno-electrophoresis, radioimmunoassays, 
radioimmunoprecipitations, enzyme-linked immuno-sorbent assays (ELISA), dot blot 
assays, western blots, immunoprecipitation, inhibition or competition assays, and 
10 sandwich assays (see U.S. Patent Nos. 4,376, 1 10 and 4,486,530; see also Antibodies: A 
Laboratory Manual , Harlow and Lane (eds ), Cold Spring Harbor Laboratory Press, 
1988). Following several clonal dilutions and reassays, a hybridoma producing 
antibodies reactive against the desired protein may be isolated. 

Other techniques may also be utilized to construct monoclonal 
15 antibodies (see William D. Huse et al., “Generation of a Large Combinational Library 
of the Immunoglobulin Repertoire in Phage Lambda,” Science 246: 1275- 1281, 
December 1989; see also L. Sastry et al., “Cloning of the Immunological Repertoire in 
Escherichia coli for Generation of Monoclonal Catalytic Antibodies: Construction of a 
Heavy Chain Variable Region-Specific cDNA Library,” Proc. Natl Acad. Sci. USA 
20 56:5728-5732, August 1989; see also Michelle Alting-Mees etal., “Monoclonal 

Antibody Expression Libraries: A Rapid Alternative to Hybridomas,” Strategies in 
Molecular Biology J:l-9, January 1990). These references describe a commercial 
system available from Stratagene (La Jolla, California) which enables the production of 
antibodies through recombinant techniques. Briefly, mRNA is isolated from a B cell 
25 population, and utilized to create heavy and light chain immunoglobulin cDNA 
expression libraries in the XlmmunoZap(H) and MmmunoZap(L) vectors. These 
vectors may be screened individually or co-expressed to form Fab fragments or 
antibodies (see Huse et al., supra ; see also Sastry et al., supra). Positive plaques may 
subsequently be converted to a non-lytic plasmid which allows high level expression of 
30 monoclonal antibody fragments from E. coli. 

Similarly, portions or fragments, such as Fab and Fv fragments, of 
antibodies may also be constructed utilizing conventional enzymatic digestion or 
recombinant DNA techniques to incorporate the variable regions of a gene which 
encodes a specifically binding antibody. Within one embodiment, the genes which 
35 encode the variable region from a hybridoma producing a monoclonal antibody of 
interest are amplified using nucleotide primers for the variable region. These primers 
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may be synthesized by one of ordinary skill in the art, or may be purchased from 
commercially available sources. Stratagene (La Jolla, California) sells primers for 
mouse and human variable regions including, among others, primers for V Ha , V^, 
Vrc> Vpjj, Cj-u, Vl and Cl regions. These primers may be utilized to amplify heavy 
5 or light chain variable regions, which may then be inserted into vectors such as 
ImmunoZAP™ H or ImmunoZAP™ L (Stratagene), respectively. These vectors may 
then be introduced into E. coli , yeast, or mammalian-based systems for expression. 
Utilizing these techniques, large amounts of a single-chain protein containing a fusion 
of the V H and V L domains may be produced (see Bird etal., Science 242\ 423-426, 
10 1988). In addition, such techniques may be utilized to change a “murine” antibody to a 

“human” antibody, without altering the binding specificity of the antibody. 

Once suitable antibodies have been obtained, they may be isolated or 
purified by many techniques well known to those of ordinary skill in the art (see 
Antibodies: A Laboratory Manna ! , Harlow and Lane (eds ), Cold Spring Harbor 
13 Laboratory Press, 1988). Suitable techniques include peptide or protein affinity 
columns, HPLC or RP-HPLC, purification on protein A or protein G columns, or any 
combination of these techniques. 

c. Mutant TGF-beta binding-proteins 

As described herein and below in the Examples (e g.. Examples 8 and 
20 9), altered versions of TGF-beta binding-protein which compete with native TGF-beta 

binding-protein’s ability to block the activity of a particular TGF-beta family member 
should lead to increased bone density. Thus, mutants of TGF-beta binding-protein 
which bind to the TGF-beta family member but do not inhibit the function of the TGF- 
beta family member would meet the criteria. The mutant versions must effectively 
25 compete with the endogenous inhibitory' functions of TGF-beta binding-protein . 

d. Production of proteins 

Although various genes (or portions thereof) have been provided herein, 
it should be understood that within the context of the present invention, reference to 
one or more of these genes includes derivatives of the genes that are substantially 
30 similar to the genes (and, where appropriate, the proteins (including peptides and 
polypeptides) that are encoded by the genes and their derivatives). As used herein, a 
nucleotide sequence is deemed to be “substantially similar” if: (a) the nucleotide 

sequence is derived from the coding region of the above-described genes and includes, 
for example, portions of the sequence or allelic variations of the sequences discussed 
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above, or alternatively, encodes a molecule which inhibits the binding of TGF-beta 
binding-protein to a member of the TGF-beta family, (b) the nucleotide sequence is 
capable of hybridization to nucleotide sequences of the present invention under 
moderate, high or very high stringency (see Sambrook etal., Molecular Cloning: A 
3 Laboratory Manual, 2nd ed., Cold Spring Harbor Laboratory Press, NY, 1989); or 
(c) the DNA sequences are degenerate as a result of the genetic code to the DNA 
sequences defined in (a) or (b). Further, the nucleic acid molecule disclosed herein 
includes both complementary' and non-complementary sequences, provided the 
sequences otherwise meet the criteria set forth herein. Within the context of the present 
10 invention, high stringency means standard hybridization conditions ( e.g ., 5XSSPE, 
0.5% SDS at 65°C, or the equivalent). 

The structure of the proteins encoded by the nucleic acid molecules 
described herein may be predicted from the primary translation products using the 
hydrophobicity plot function of, for example, P/C Gene or Intelligenetics Suite 
15 (Intelligenetics, Mountain View, California), or according to the methods described by 
Kyte and Doolittle (./. Mol. Biol 757:105-132, 1982). 

Proteins of the present invention may be prepared in the form of acidic 
or basic salts, or in neutral form. In addition, individual amino acid residues may be 
modified by oxidation or reduction. Furthermore, various substitutions, deletions, or 
20 additions may be made to the amino acid or nucleic acid sequences, the net effect of 
which is to retain or further enhance or decrease the biological activity of the mutant or 
wild-type protein. Moreover, due to degeneracy in the genetic code, for example, there 
may be considerable variation in nucleotide sequences encoding the same amino acid 
sequence. 

25 Other derivatives of the proteins disclosed herein include conjugates of 

the proteins along with other proteins or. polypeptides. This may be accomplished, for 
example, by the synthesis of N-terminal or C-terminal fusion proteins which may be 
added to facilitate purification or identification of proteins (see U S. Patent No. 
4,851,341, see also , Hopp etal., Bio/Technology <5:1204, 1988.) Alternatively, fusion 
30 proteins such as Flag/TGF-beta binding-protein be constructed in order to assist in the 
identification, expression, and analysis of the protein. 

Proteins of the present invention may be constructed using a wide 
variety of techniques described herein. Further, mutations may be introduced at 
particular loci by synthesizing oligonucleotides containing a mutant sequence, flanked 
35 by restriction sites enabling ligation to fragments of the native sequence. Following 
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ligation, the resulting reconstructed sequence encodes a derivative having the desired 
amino acid insertion, substitution, or deletion. 

Alternatively, oligonucleotide-directed site-specific (or segment 
specific) mutagenesis procedures may be employed to provide an altered gene having 
5 particular codons altered according to the substitution, deletion, or insertion required. 
Exemplary methods of making the alterations set forth above are disclosed by Walder 
etal. ( Gene 42: 133, 1986); Bauer etal. ( Gene 3 7:73, 1985); Craik ( BioTechniques , 
January 1985, 12-19); Smith etal. (Genetic Engineering: Principles and Methods 
Plenum Press, 1981); and Sambrook etal. (supra). Deletion or truncation derivatives 
to of proteins (e g., a soluble extracellular portion) may also be constructed by utilizing 
convenient restriction endonuclease sites adjacent to the desired deletion. Subsequent 
to restriction, overhangs may be filled in, and the DNA religated. Exemplary methods 
of making the alterations set forth above are disclosed by Sambrook et al. (Molecular 
Cloning: A Laboratory Manual, 2d Ed., Cold Spring Harbor Laboratory Press, 1989). 

15 Mutations which are made in the nucleic acid molecules of the present 

invention preferably preserve the reading frame of the coding sequences. Furthermore, 
the mutations will preferably not create complementary regions that could hybridize to 
produce secondary mRMA structures, such as loops or hairpins, that would adversely 
affect translation of the mRNA. Although a mutation site may be predetermined, it is 
20 not necessary that the nature of the mutation per se be predetermined. For example, in 
order to select for optimum characteristics of mutants at a given site, random 
mutagenesis may, be conducted at the target codon and the expressed mutants screened 
for indicative biological activity. Alternatively, mutations may be introduced at 
particular loci by synthesizing oligonucleotides containing a mutant sequence, flanked 
25 by restriction sites enabling ligation to fragments of the native sequence. Following 
ligation, the resulting reconstructed sequence encodes a derivative having the desired 
amino acid insertion, substitution, or deletion. 

Nucleic acid molecules which encode proteins of the present invention 
may also be constructed utilizing techniques of PCR mutagenesis, chemical 
30 mutagenesis (Drinkwater and Klinedinst, PNAS 83: 3402-3406, 1986), by forced 
nucleotide misincorporation (e.g., Liao and Wise Gene 55:107-1 11, 1990), or by use of 
randomly mutagenized oligonucleotides (Horwitz et al.. Genome 3:1 12-117, 1989). 

The present invention also provides for the manipulation and expression 
of the above described genes by culturing host cells containing a vector capable of 
35 expressing the above-described genes. Such vectors or vector constructs include either 
synthetic or cDNA-derived nucleic acid molecules encoding the desired protein, which 
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are operably linked to suitable transcriptional or translational regulatory elements. 
Suitable regulatory elements may be derived from a variety of sources, including 
bacterial, fungal, viral, mammalian, insect, or plant genes. Selection of appropriate 
regulatory elements is dependent on the host cell chosen, and may be readily 
5 accomplished by one of ordinary' skill in the art. Examples of regulatory elements 
include: a transcriptional promoter and enhancer or RNA polymerase binding 

sequence, a transcriptional terminator, and a ribosomal binding sequence, including a 
translation initiation signal. 

Nucleic acid molecules that encode any of the proteins described above 
10 may be readily expressed by a wide variety of prokaryotic and eukaryotic host cells, 
including bacterial, mammalian, yeast or other fungi, viral, insect, or plant cells. 
Methods for transforming or transfecting such cells to express foreign DNA are well 
known in the art (see, e.g ., Itakura et al., U.S. Patent No. 4,704,362; Hinnen etal., 
Pruc. Natl. Acad. Sci. USA "5:1929-1933, 1978; Murray etal., U.S. Patent 
15 No. 4,801,542; Upshall etal , U.S. Patent No. 4,935,349; Hagen etal., U S. Patent 
No. 4,784,950; Axel etal., U.S. Patent No. 4,399,216; Goeddel etal., U.S. Patent 
No. 4,766,075; and Sambrook et al. Molecular Cloning: A Laboratory Manual , 2nd 
ed., Cold Spring Harbor Laboratory Press, 1989; for plant cells see Czako and Marton, 
Plant Physiol. 707:1067-1071, 1994; and Paszkowski etal.. Biotech. 27:387-392, 
20 1992). 

Bacterial host cells suitable for carrying out the present invention 
include E. coli, B. sub/ilis , Salmonella typhinmrium, and various species within the 
genera Pseudomonas, Streptomyces, and Staphylococcus, as well as many other 
bacterial species well known to one of ordinary skill in the art. Representative 
25 examples of bacterial host cells include DH5a (Stratagene, LaJolla, California). 

Bacterial expression vectors preferably comprise a promoter which 
functions in the host cell, one or more selectable phenotypic markers, and a bacterial 
origin of replication. Representative promoters include the P-lactamase (penicillinase) 
and lactose promoter system (see Chang etal.. Nature 275: 615, 1978), the T7 RNA 
30 polymerase promoter (Studier et al., Melh. Enzymol. 755:60-89, 1990), the lambda 
promoter (Elvin et al., Gene 57:123-126, 1990), the trp promoter (Nichols and 
Yanofsky, Meih. in Enzymology 707:155, 1983) and the tac promoter (Russell etal., 
Gene 20: 231, 1982). Representative selectable markers include various antibiotic 
resistance markers such as the kanamycin or ampicillin resistance genes. Many 
35 plasmids suitable for transforming host cells are well known in the art, including among 
others, pBR322 (see Bolivar etal., Gene 2:95, 1977), the pUC plasmids pUCl8, 
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pUC19, pUC118, pUCl 19 (see Messing, Meth. in Enzymology 101: 20-77, 1983 and 
Vieira and Messing, Gene 19: 259-268, 1982), and pNH8A, pNH16a, pNHISa, and 
Bluescript Ml 3 (Stratagene, La Jolla, California). 

Yeast and fungi host cells suitable for carrying out the present invention 
5 include, among others, Saccharomyces pombe, Saccharomyces cerevisiae , the genera 
Pichia or Khiyveromyces and various species of the genus Aspergillus (McKnight et al., 
U.S. Patent No. 4,935,349). Suitable expression vectors for yeast and fungi include, 
among others, YCp50 (ATCC No. 37419) for yeast, and the amdS cloning vector pV3 
(Turnbull, Bio/Technology 7:169, 1989), YRp7 (Struhl et al., Proc. Natl. Acad. Sci. 
10 USA 7(5:1035-1039, 1978), YEpl3 (Broach etal., Gene 8: 121-133, 1979), pJDB249 
and pJDB219 (Beggs, Nature 275:104-108, 1978) and derivatives thereof. 

Preferred promoters for use in yeast include promoters from yeast 
glycolytic genes (Hitzeman etal., J. Biol. Chem. 255: 12073-12080, 1980; Albcr and 
Kawasaki, .1. Mol. Appl. Genet. 7:419-434, 1982) or alcohol dehydrogenase genes 
15 (Young et al., in Genetic Engineering of Microorganisms for Chemicals , Hollaender 
et al. (eds.), p. 355, Plenum, New York, 1982; Ammerer, Meth. Enzymol. 707:192-201, 
1983). Examples of useful promoters for fungi vectors include those derived from 
Aspergillus nidulans glycolytic genes, such as the ad.h3 promoter (McKnight etal., 
EMBO .1. 4: 2093-2099, 1985). The expression units may also include a transcriptional 
20 terminator. An example of a suitable terminator is the adh3 terminator (McKnight 
et al., ibid., 1985). 

As with bacterial vectors, the yeast vectors will generally include a 
selectable marker, which may be one of any number of genes that exhibit a dominant 
phenotype for which a phenotypic assay exists to enable transformants to be selected. 
25 Preferred selectable markers are those that complement host cell auxotrophy, provide 
antibiotic resistance or enable a cell to utilize specific carbon sources, and include leu2 
(Broach etal., ibid), ura3 (Botstein etal., Gene 8: 17, 1979), or his3 (Struhl etal., 
ibid.). Another suitable selectable marker is the cat gene, which confers 
chloramphenicol resistance on yeast cells. 

30 Techniques for transforming fungi are well known in the literature, and 

have been described, for instance, by Beggs (ibid.), Hinnen et al. (Proc. Natl. Acad. 
Sci. USA 75: 1929-1933, 1978), Yelton etal. (Proc. Natl. Acad. Sci. USA £7:1740- 
1747, 1984), and Russell (Nature 307:167-169, 1983). The genotype of the host cell 
may contain a genetic defect that is complemented by the selectable marker present on 
35 the expression vector. Choice of a particular host and selectable marker is well within 
the level of ordinary skill in the art. 
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Protocols for the transformation of yeast are also well known to those of 
ordinary skill in the art. For example, transformation may be readily accomplished 
either by preparation of spheroplasts of yeast with DNA (see Hinnen et ah, PNAS USA 
75:1929, 1978) or by treatment with alkaline salts such as LiCl (see Itoh etal., ,/. 

5 Bacteriology! 755:163, 1983). Transformation of fungi may also be carried out using 
polyethylene glycol as described by Cullen et al. (Bio/Technology 5:369, 1987). 

Viral vectors include those which comprise a promoter that directs the 
expression of an isolated nucleic acid molecule that encodes a desired protein as 
described above. A wide variety of promoters may be utilized within the context of the 
10 present invention, including for example, promoters such as MoMLV LTR, RSV LTR, 
Friend MuLV LTR, adenoviral promoter (Ohno etal., Science 2(55:781-784, 1994), 
neomycin phosphotransferase promoter/enhancer, late parvovirus promoter (Koering 
etal.. Hum. Gene Therap. 5:457-463, 1994), Herpes TK promoter, SV40 promoter, 
metallothionein 11a gene enhancer/promoter, cytomegalovirus immediate early 
15 promoter, and the cytomegalovirus immediate late promoter. Within particularly 
preferred embodiments of the invention, the promoter is a tissue-specific promoter (see 
> WO 91/02805; EP 0,415,731; and WO 90/07936). Representative examples of 
suitable tissue specific promoters include neural specific enolase promoter, platelet 
derived growth factor beta promoter, bone morphogenic protein promoter, human 
20 alphal-chimaerin promoter, synapsin 1 promoter and synapsin II promoter. In addition 
to the above-noted promoters, other viral-specific promoters (e g., retroviral promoters 
(including those noted above, as well as others such as HIV promoters), hepatitis, 
herpes (e.g., EBV), and bacterial, fungal or parasitic ( e.g. , malarial) -specific promoters 
may be utilized in order to target a specific cell or tissue which is infected with a virus, 
25 bacteria, fungus or parasite. 

Mammalian cells suitable for carrying out the present invention include, 
among others COS, CHO, SaOS, osteosarcomas, KS483, MG-63, primary osteoblasts, 
and human or mammalian bone marrow stroma. Mammalian expression vectors for use 
in carrying out the present invention will include a promoter capable of directing the 
30 transcription of a cloned gene or cDNA. Preferred promoters include viral promoters 
and cellular promoters. Bone specific promoters include the bone sialo-protein and the 
promoter for osteocalcin. Viral promoters include the cytomegalovirus immediate early 
promoter (Boshart etal., Cel! 4i. 521-530, 1985), cytomegalovirus immediate late 
promoter, SV40 promoter (Subramani etal.. Mol. Cell. Biol. /:854-864, 1981), MMTV 
35 LTR, RSV LTR, metallothionein- 1, adenovirus El a. Cellular promoters include the 
mouse metallothionein- 1 promoter (Palmiter etal., U.S. Patent No. 4,579,821), a 
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mouse V K promoter (Bergman et al., Proc. Natl Acad. Sci. USA 3/:7041-7045, 1983; 
Grant et al., Nucl. Acids Res. 15. 5496, 1987) and a mouse Vj^ promoter (Loh etal.. 
Cell 33:85-93, 1983). The choice of promoter will depend, at least in part, upon the 
level of expression desired or the recipient cell line to be transfected. 

5 Such expression vectors may also contain a set of RNA splice sites 

located downstream from the promoter and upstream from the DNA sequence encoding 
the peptide or protein of interest. Preferred RNA splice sites may be obtained from 
adenovirus and/or immunoglobulin genes. Also contained in the expression vectors is a 
polyadenylation signal located downstream of the coding sequence of interest. Suitable 
10 polyadenylation signals include the early or late polyadenylation signals from SV40 
(Kaufman and Sharp, ibid.), the polyadenylation signal from the Adenovirus 5 E1B 
region and the human growth hormone gene terminator (DeNoto et al., Nuc. Acids Res. 
9:3719-3730, 1981). The expression vectors may include a noncoding viral leader 
sequence, such as the Adenovirus 2 tripartite leader, located between the promoter and 
15 the RNA splice sites. Preferred vectors may also include enhancer sequences, such as 
the SV40 enhancer. Expression vectors may also include sequences encoding the 
adenovirus VA RNAs. Suitable expression vectors can be obtained from commercial 
sources (e.g., Stratagene, La Jolla, California). 

Vector constructs comprising cloned DNA sequences can be introduced 
20 into cultured mammalian cells by, for example, calcium phosphate-mediated 
transfection (Wigler etal., Cell 14. 725, 1978; Corsaro and Pearson, Somatic Cell 
Genetics 7:603, 1981; Graham and Van der Eb, Virology 52:456, 1973), electroporation 
(Neumann et al., EMBO J. 7:841-845, 1982), or DEAE-dextran mediated transfection 
(Ausubel et al. (eds), Current Protocols in Molecular Biology , John Wiley and Sons, 
25 Inc., NY, 1987). To identify cells that have stably integrated the cloned DNA, a 
selectable marker is generally introduced into the cells along with the gene or cDNA of 
interest Preferred selectable markers for use in cultured mammalian cells include 
genes that confer resistance to drugs, such as neomycin, hygromycin, and methotrexate. 
The selectable marker may be an amplifiable selectable marker. Preferred amplifiable 
30 selectable markers are the DHFR gene and the neomycin resistance gene. Selectable 
markers are reviewed by Thilly ( Mammalian Cell Technology , Butterworth Publishers, 
Stoneham, Massachusetts, which is incorporated herein by reference). 

Mammalian cells containing a suitable vector are allowed to grow for a 
period of time, typically 1-2 days, to begin expressing the DNA sequence(s) of interest. 
35 Drug selection is then applied to select for growth of cells that are expressing the 
selectable marker in a stable fashion. For cells that have been transfected with an 
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amplifiable, selectable marker the drug concentration may be increased in a stepwise 
manner to select for increased copy number of the cloned sequences, thereby increasing 
expression levels. Cells expressing the introduced sequences are selected and screened 
for production of the protein of interest in the desired form or at the desired level. 

5 Cells that satisfy these criteria can then be cloned and scaled up for production. 

Protocols for the transfection of mammalian cells are well known to 
those of ordinary skill in the art. Representative methods include calcium phosphate 
mediated transfection, electroporation, lipofection, retroviral, adenoviral and protoplast 
fusion-mediated transfection (see Sambrook et al., supra). Naked vector constructs can 
10 also be taken up by muscular cells or other suitable cells subsequent to injection into 
the muscle of a mammal (or other animals). 

Numerous insect host cells known in the art can also be useful within the 
present invention, in light of the subject specification. For example, the use of 
baculoviruses as vectors for expressing heterologous DNA sequences in insect cells has 
15 been reviewed by Atkinson et al. ( Pestic . Sci. 28: 21 5-224, 1990). 

Numerous plant host cells known in the art can also be useful within the 
present invention, in light of the subject specification. For example, the use of 
Agrobacterium rhizogenes as vectors for expressing genes in plant cells has been 
reviewed by Sinkar et al. (./. Biosci. (Bangalore) 77:47-58, 1987). 

20 Within related aspects of the present invention, proteins of the present 

invention may be expressed in a transgenic animal whose germ cells and somatic cells 
contain a gene which encodes the desired protein and which is operably linked to a 
promoter effective for the expression of the gene. Alternatively, in a similar manner 
transgenic animals may be prepared that lack the desired gene (e.g., “knock-out” mice). 
25 Such transgenics may be prepared in a variety of non-human animals, including mice, 
rats, rabbits, sheep, dogs, goats and pigs (see Hammer et al.. Nature 375:680-683, 
1985, Palmiter et al. Science 222:809-814, 1983, Brinster et al, Proc. Natl. Acad. Sci. 
USA 52:4438-4442, 1985, Palmiter and Brinster, Cell -77:343-345, 1985, and U.S. 
Patent Nos. 5,175,383, 5,087,571, 4,736,866, 5,387,742, 5,347,075, 5,221,778, and 
30 5,175,384). Briefly, an expression vector, including a nucleic acid molecule to be 

expressed together with appropriately positioned expression control sequences, is 
introduced into pronuclei of fertilized eggs, for example, by microinjection. Integration 
of the injected DNA is detected by blot analysis of DNA from tissue samples. It is 
preferred that the introduced DNA be incorporated into the germ line of the animal so 
35 that it is passed on to the animal’s progeny. Tissue-specific expression may be 
achieved through the use of a tissue-specific promoter, or through the use of an 
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inducible promoter, such as the metallothionein gene promoter (Palmiter etal., 1983, 
ibid), which allows regulated expression of the transgene. 

Proteins can be isolated by, among other methods, culturing suitable 
host and vector systems to produce the recombinant translation products of the present 
5 invention. Supernatants from such cell lines, or protein inclusions or whole cells where 
the protein is not excreted into the supernatant, can then be treated bv a variety of 
purification procedures in order to isolate the desired proteins. For example, the 
supernatant may be First concentrated using commercially available protein 
concentration filters, such as an Amicon or Millipore Pellicon ultrafiltration unit. 
10 Following concentration, the concentrate may be applied to a suitable purification 
matrix such as, for example, an anti-protein antibody bound to a suitable support. 
Alternatively, anion or cation exchange resins may be employed in order to purify the 
protein. As a further alternative, one or more reverse-phase high performance liquid 
chromatography (RP-HPLC) steps may be employed to further purify the protein. 
15 Other methods of isolating the proteins of the present invention are well known in the 
skill of the art. 

A protein is deemed to be “isolated” within the context of the present 
invention if no other (undesired) protein is detected pursuant to SDS-PAGE analysis 
followed by Coomassie blue staining. Within other embodiments, the desired protein 
20 can be isolated such that no other (undesired) protein is detected pursuant to SDS- 
PAGE analysis followed by silver staining. 

3. Nucleic Acid Molecules 

Within other aspects of the invention, nucleic acid molecules are 
provided which are capable of inhibiting TGF-beta binding-protein binding to a 
25 member of the TGF-beta family. For example, within one embodiment antisense 
oligonucleotide molecules are provided which specifically inhibit expression of TGF- 
beta binding-protein nucleic acid sequences (see generally , Hirashima et al. in 
Molecular Biology of RNA: New Perspectives (M. Inouye and B. S. Dudock, eds., 1987 
Academic Press, San Diego, p. 401); Oligonucleotides: Antisense Inhibitors of Gene 
30 Expression (J.S. Cohen, ed., 1989 MacMillan Press, London); Stein and Cheng, Science 
261: 1004-1 01 2, 1993; WO 95/10607; U.S. Patent No. 5,359,051; WO 92/06693; and 
EP-A2-6 12844). Briefly, such molecules are constructed such that they are 
complementary to, and able to form Watson-Crick base pairs with, a region of 
transcribed TGF-beta binding-protein mRNA sequence. The resultant double-stranded 
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nucleic acid interferes with subsequent processing of the mRNA, thereby preventing 
protein synthesis (see Example 10) 

Within other aspects of the invention, ribozymes are provided which are 
capable of inhibiting the TGF-beta binding-protein binding to a member of the TGF- 
5 beta family. As used herein, “ribozymes” are intended to include RNA molecules that 
contain anti-sense sequences for specific recognition, and an RNA-cleaving enzymatic 
activity. The catalytic strand cleaves a specific site in a target RNA at greater than 
stoichiometric concentration. A wide variety of ribozymes may be utilized within the 
context of the present invention, including for example, the hammerhead ribozyme (for 
10 example, as described by Forster and Symons, Cel! 48: 211-220, 1987; Flaseloff and 
Gerlach, Nature 328. 596-600, 1988; Walbot and Bruening, Nature 334: 196, 1988; 
Haseloff and Gerlach, Nature 334: 585, 1988); the hairpin ribozyme (for example, as 
described by Haseloff etal., U.S. Patent No. 5,254,678, issued October 19, 1993 and 
Hempel et al„ European Patent Publication No. 0 360 257, published March 26, 1990), 
15 and Tetrahymena ribosomal RNA-based ribozymes (see Cech et al., U.S. Patent No. 
4,987,071). Ribozymes of the present invention typically consist of RNA, but may also 
be composed of DNA, nucleic acid analogs ( e.g ., phosphorothioates), or chimerics 
thereof (e.g., DNA/RNA/RNA). 

4. Labels 

20 The gene product or any of the candidate molecules described above and 

below, may be labeled with a variety of compounds, including for example, fluorescent 
molecules, toxins, and radionuclides. Representative examples of fluorescent 
molecules include fluorescein, Phycohili proteins, such as phycoerythrin, rhodamine, 
Texas red and luciferase. Representative examples of toxins include ricin, abrin 
25 diphtheria toxin, cholera toxin, gelonin, pokeweed antiviral protein, tritin, Shigella 
toxin, and Pseudomonas exotoxin A. Representative examples of radionuclides include 
Cu-64, Ga-67, Ga-68, Zr-89, Ru-97, Tc-99m, Rh-105, Pd-109, ln-111, 1-123, 1-125, I- 
131, Re-186, Re-188, Au-198, Au-199, Pb-203, At-211, Pb-212 and Bi-212. In 
addition, the antibodies described above may also be labeled or conjugated to one 
30 partner of a ligand binding pair. Representative examples include avidin-biotin, and 
riboflavin-riboflavin binding protein. 

Methods for conjugating or labeling the molecules described herein with 
the representative labels set forth above may be readily accomplished by one of 
ordinary skill in the art {see Trichothecene Antibody Conjugate, U.S. Patent No. 
35 4,744,981; Antibody Conjugate, U.S. Patent No. 5,106,951; Fluorogenic Materials and 
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Labeling Techniques, U.S. Patent No. 4,018,884; Metal Radionuclide Labeled Proteins 
for Diagnosis and Therapy, U.S. Patent No. 4,897,255; and Metal Radionuclide 
Chelating Compounds for Improved Chelation Kinetics, U.S. Patent No. 4,988,496; see 
also Inman, Methods In Enzymology , Vol. 34, Affinity Techniques, Enzyme 
5 Purification: Part B , Jakoby and Wilchek (eds.). Academic Press, New York, p. 30, 
1974; see also Wilchek and Bayer, “The Avidin-Biotin Complex in Bioanalytical 
Applications,” Anal. Biochem. 777:1-32, 1988). 

Pharmaceutical Compositions 

As noted above, the present invention also provides a variety of 
10 pharmaceutical compositions, comprising one of the above-described molecules which 
inhibits the TGF -beta binding-protein binding to a member of the TGF-beta family 
along with a pharmaceutically or physiologically acceptable carrier, excipients or 
diluents. Generally, such carriers should be nontoxic to recipients at the dosages and 
concentrations employed. Ordinarily, the preparation of such compositions entails 
15 combining the therapeutic agent with buffers, antioxidants such as ascorbic acid, low 
molecular weight (less than about 10 residues) polypeptides, proteins, amino acids, 
carbohydrates including glucose, sucrose or dextrins, chelating agents such as EDTA, 
glutathione and other stabilizers and excipients. Neutral buffered saline or saline mixed 
with nonspecific serum albumin are exemplary appropriate diluents. 

20 In addition, the pharmaceutical compositions of the present invention 

may be prepared for administration by a variety of different routes. In addition, 
pharmaceutical compositions of the present invention may be placed within containers, 
along with packaging material which provides instructions regarding the use of such 
pharmaceutical compositions. Generally, such instructions will include a tangible 
25 expression describing the reagent concentration, as well as within certain embodiments, 
relative amounts of excipient ingredients or diluents (e.g., water, saline or PBS) which 
may be necessary to reconstitute the pharmaceutical composition. 

Methods of Treatment 

The present invention also provides methods for increasing the mineral 
30 content and mineral density of bone. Briefly, numerous conditions result in the loss of 
bone mineral content, including for example, disease, genetic predisposition, accidents 
which result in the lack of use of bone (e.g., due to fracture), therapeutics which effect 
bone resorption, or which kill bone forming cells and normal aging. Through use of 
the molecules described herein which inhibit the TGF-beta binding-protein binding to a 
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TGF-beta family member such conditions may be treated or prevented As utilized 
herein, it should be understood that bone mineral content has been increased, if bone 
mineral content has been increased in a statistically significant manner {e.g., greater 
than one-half standard deviation), at a selected site. 

5 A wide variety of conditions which result in loss of bone mineral content 

may be treated with the molecules described herein. Patients with such conditions may 
be identified through clinical diagnosis utilizing well known techniques (see, 
e.g., Harrison s Principles of Internal Medicine, McGraw-Hill, Inc.). Representative 
examples of diseases that may be treated included dysplasias, wherein there is abnormal 
10 growth or development of bone. Representative examples of such conditions include 
achondroplasia, cleidocranial dysostosis, enchondromatosis, fibrous dysplasia, 
Gaucher s, hypophosphatemic rickets, Marfan's, multiple hereditary exotoses, 
neurofibromatosis, osteogenesis imperfecta, osteopetrosis, osteopoikilosis, sclerotic 
lesions, fractures, periodontal disease, pseudoarthrosis and pyogenic osteomyelitis. 

15 Other conditions which may be treated or prevented include a wide 

variety of causes of osteopenia (/.<?., a condition that causes greater than one standard 
deviation of bone mineral content or density below peak skeletal mineral content at 
youth). Representative examples of such conditions include anemic states, conditions 
caused steroids, conditions caused by heparin, bone marrow disorders, scurvy, 
20 malnutrition, calcium deficiency, idiopathic osteoporosis, congenital osteopenia or 
osteoporosis, alcoholism, chronic liver disease, senility, postmenopausal state, 
oligomenorrhea , amenorrhea, pregnancy, diabetes mellitus, hyperthyroidism, 
Cushing's disease, acromegaly, hypogonadism, immobilization or disuse, reflex 
sympathetic dystrophy syndrome, transient regional osteoporosis and osteomalacia. 

25 Within one aspect of the present invention, bone mineral content or 

density may be increased by administering to a warm-blooded animal a therapeutically 
effective amount of a molecule which inhibits the TGF-beta binding-protein binding to 
a TGF-beta family member. Examples of warm-blooded animals that may be treated 
include both vertebrates and mammals, including for example horses, cows, pigs, 
30 sheep, dogs, cats, rats and mice. Representative examples of therapeutic molecules 
include ribozymes, ribozyme genes, antisense oligonucleotides and antibodies (e.g, 
humanized antibodies). 

Within other aspects of the present invention, methods are provided for 
increasing bone density, comprising the step of introducing into cells which home to 
35 bone a vector which directs the expression of a molecule which inhibits the TGF-beta 
binding-protein binding to a member of the TGF-beta family, and administering the 
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vector containing cells to a warm-blooded animal. Briefly, cells which home to bone 
may be obtained directly from the bone of patients (e.g. , cells obtained from the bone 
marrow such as CD34+, osteoblasts, osteocytes, and the like), from peripheral blood, or 
from cultures. 

5 A vector which directs the expression of a molecule that inhibits the 

TGF-beta binding-protein binding to a member of the TGF-beta family is introduced 
into the cells. Representative examples of suitable vectors include viral vectors such as 
herpes viral vectors (e.g., U S. Patent No. 5,288,641), adenoviral vectors (e.g., WO 
94/26914, WO 93/9191; Rolls et al., PNAS 91( 1) 215-219, 1994; Kass-Eisler etal., 
10 PNAS 90( 24): 1 1498-502, 1993; Guzman etal., Circulation 55(6):2838-48, 1993; 
Guzman etal., Cir. Res. 73(6): 1202- 1207, 1993; Zabner etal., Cell 75(2) .'207-216, 
1993; Li etal.. Hum Gene Thcr. 7(4):403-409, 1993; Caillaud etal.. Paw. J. Neurosci. 
5(10:1287-1291, 1993; Vincent et al., Nat. Genet. 5(2):130-134, 1993; Jaffe et al., Nat. 
Genet. /(5):372-378, 1992; and Levrero etal.. Gene 101(2): 195-202, 1991), adeno- 
15 associated viral vectors (WO 95/13365; Flotte el al,, PNAS 90(22): 106 13- 106 17, 1993), 
baculovirus vectors, parvovirus vectors (Koering et al.. Hum. Gene Therap. 5:457-463, 
1994), pox virus vectors (Panicali and Paoletti, PNAS 79:4927-4931, 1982; and Ozaki 
etal., Biochem. Biophys. Res. Comm. 193( 2):653-660, 1993), and retroviruses 
(e.g., EP 0,415,731; WO 90/07936; WO 91/0285, WO 94/03622; WO 93/25698; 
20 WO 93/25234; U.S. Patent No. 5,219,740; WO 93/11230; WO 93/10218). Viral 
vectors may likewise be constructed which contain a mixture of different elements 
(e.g., promoters, envelope sequences and the like) from different viruses, or non-viral 
sources. Within various embodiments, either the viral vector itself, or a viral particle 
which contains the viral vector may be utilized in the methods and compositions 
25 described below. 

Within other embodiments of the invention, nucleic acid molecules 
which encode a molecule which inhibits the TGF-beta binding-protein binding to a 
member of the TGF-beta family themselves may be administered by a variety of 
techniques, including, for example, administration of asialoosomucoid (ASOR) 
30 conjugated with poly-L-lysine DNA complexes (Cristano et al., PNAS 92122-92126, 
1993), DNA linked to killed adenovirus (Curiel et al.. Hum. Gene Ther. J(2): 147-154, 
1992), cytofectin-mediated introduction (DMRIE-DOPE, Vical, California), direct 
DNA injection (Acsadi etal., Nature 552:815-818, 1991); DNA ligand (Wu etal., J. of 
Biol. Chem. 264:16985-16987, 1989); lipofection (Feigner et al., Proc. Natl. Acad Sci. 
35 USA 54:7413-741 7, 1989), liposomes (Pickering etal., Circ. 59(1):13-21, 1994; and 
Wang etal., PNAS 54:7851-7855, 1987); microprojectile bombardment (Williams 
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et al., PNAS 55:2726-2730, 1991); and direct delivery of nucleic acids which encode 
the protein itself either alone (Vile and Hart, Cancer Res. 53: 3860-3864, 1993), or 
utilizing PEG-nucleic acid complexes. 

Representative examples of molecules which may be expressed by the 
5 vectors of present invention include ribozymes and antisense molecules, each of which 
are discussed in more detail above. 

Determination of increased bone mineral content may be determined 
directly through the use of X-rays (eg.. Dual Energy X-ray Absorptometry or 
“DEXA”), or by inference through bone turnover markers (osteoblast specific alkaline 
10 phosphatase, osteocalcin, type 1 procollagen C’ propeptide (PICP), and total alkaline 
phosphatase; see Cornier, C., Curr. Opin. in Rheu. 7:243, 1995), or markers of bone 
resorption (pyridinoline, deoxypryridinoline, N-telopeptide, urinary hydroxyproline, 
plasma tartrate-resistant acid phosphatases and galactosyl hydroxylysine; see Cornier, 
supra). The amount of bone mass may also be calculated from body weights, or 
15 utilizing other methods (see Guinness-Hey, Me lab. Bone Dis. and Re/. Res. 5:177-181, 
1984). 

As will be evident to one of skill in the art, the amount and frequency of 
administration will depend, of course, on such factors as the nature and severity of the 
indication being treated, the desired response, the condition of the patient, and so forth. 
20 Typically, the compositions may be administered by a variety of techniques, as noted 
above. 

The following examples are offered by way of illustration, and not by 
way of limitation. 
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EXAMPLES 
EXAMPLE 1 

SCLEROSTEOSIS MAPS TO THE LONG ARM OF HUMAN CHROMOSOME 1 7 

Genetic mapping of the defect responsible for sclerosteosis in humans 
5 localized the gene responsible for this disorder to the region of human chromosome 17 
that encodes a novel TGF-beta binding-protein family member. In sclerosteosis, 
skeletal bone displays a substantial increase in mineral density relative to that of 
unafflicted individuals. Bone in the head displays overgrowth as well. Sclerosteosis 
patients are generally healthy although they may exhibit variable degrees of syndactyly 
10 at birth and variable degrees of cranial compression and nerve compression in the skull. 

Linkage analysis of the gene detect associated with sclerosteosis was 
conducted by applying the homozygosity mapping method to DNA samples collected 
from 24 South African Afrikaaner families in which the disease occurred. (Sheffield 
et al„ 1994, Human Molecular Genetics 5: 1331-1335. “Identification of a Bardet-Biedl 
15 syndrome locus on chromosome 3 and evaluation of an efficient approach to 
homozygosity mapping”). The Afrikaaner population of South Africa is genetically 
homogeneous, the population is descended from a small number of founders who 
colonized the area several centuries ago, and it has been isolated by geographic and 
social barriers since the founding. Sclerosteosis is rare everywhere in the world outside 
20 the Afrikaaner community, which suggests that a mutation in the gene was present in 
the founding population and has since increased in numbers along with the increase .in 
the population. The use of homozygosity mapping is based on the assumption that 
DNA mapping markers adjacent to a recessive mutation are likely to be homozygous 
in affected individuals from consanguineous families and isolated populations. 

25 A set 37 1 microsatellite markers (Research Genetics, Set 6) from the 

autosomal chromosomes was selected to type pools of DNA from sclerosteosis patient 
samples. The DNA samples for this analysis came from 29 sclerosteosis patients in 24 
families, 59 unaffected family members and a set of unrelated control individuals from 
the same population. The pools consisted of 4-6 individuals, either affected 
30 individuals, affected individuals from consanguineous families, parents and unaffected 
siblings, or unrelated controls. In the pools of unrelated individuals and in most of the 
pools with affected individuals or family members analysis of the markers showed 
several allele sizes for each marker. One marker, D17S1299, showed an indication of 
homozygosity: one band in several of the pools of affected individuals. 
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All 24 sclerosteosis families were typed with a total of 19 markers in the 
region of D17S1299 (at I7ql2-q21) Affected individuals from every family were 
shown to be homozygous in this region, and 25 of the 29 individuals were homozygous 
for a core haplotype; they each had the same alleles between D17S1787 and D17S930. 

5 The other four individuals had one chromosome which matched this haplotype and a 
second which did not. In sum, the data compellingly suggested that this 3 megabase 
region contained the sclerosteosis mutation. Sequence analysis of most of the exons in 
this 3 megabase region identified a nonsense mutation in the novel TGF-beta binding- 
protein coding sequence (C>T mutation at position 1 17 of Sequence ID No. 1 results in 
10 a stop codon). This mutation was shown to be unique to sclerosteosis patients and 
carriers of Afrikaaner descent. The identity of the gene was further confirmed by 
identifying a mutation in its intron (A>T mutation at position +3 of the intron) which 
results in improper mRNA processing in a single, unrelated patient with diagnosed 
sclerosteosis. 

15 

EXAMPLE 2 

Tissue-Specificity of TGF-beta Binding-Protein Gene Expression 
A. Human Beer Gene Expression by RT-PCR: 

20 First-strand cDNA was prepared from the following total RNA samples 

using a commercially available kit (’‘Superscript Preamplification System for First- 
Strand cDNA Synthesis”, Life Technologies, Rockville, MD): human brain, human 
liver, human spleen, human thymus, human placenta, human skeletal muscle, human 
thyroid, human pituitary, human osteoblast (NHOst from Clonetics Corp., San Diego, 
25 CA), human osteosarcoma cell line (Saos-2, ATCC# HTB-85), human bone, human 
bone marrow, human cartilage, vervet monkey bone, saccharomyces cerevisiae, and 
human peripheral blood monocytes. All RNA samples were purchased from a 
commercial source (Clontech, Palo Alto, CA), except the following which were 
prepared in-house: human osteoblast, human osteosarcoma cell line, human bone, 
30 human cartilage and vervet monkey bone. These in-house RNA samples were prepared 
using a commercially available kit (“TRI Reagent”, Molecular Research Center, Inc., 
Cincinnati, OH). 

PCR was performed on these samples, and additionally on a human 
genomic sample as a control. The sense Beer oligonucleotide primer had the sequence 
35 5’-CCGGAGCTGGAGAACAACAAG-3’ (SEQ ID NO: 19). The antisense Beer 
oligonucleotide primer had the sequence 5’-GCACTGGCCGGAGCACACC-3’ (SEQ 
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ID NO:20). In addition, PCR was performed using primers for the human beta-actin 
gene, as a control. The sense beta-actin oligonucleotide primer had the sequence 5’- 
AGGCCAACCGCGAGAAGATGA CC -3’ (SEQ ID NO:21). The antisense beta-actin 
oligonucleotide primer had the sequence 5’-GAAGT CCAGGGCGACGTAGCA-3’ 
5 (SEQ ID NO:22). PCR was performed using standard conditions in 25 ul reactions, 
with an annealing temperature of 61 degrees Celsius. Thirty-two cycles of PCR were 
performed with the Beer primers and twenty-four cycles were performed with the beta- 
actin primers. 

Following amplification, 12 ul from each reaction were analyzed by 
10 agarose gel electrophoresis and ethidium bromide staining. See Figure 2A. 

B. RNA In-situ Hybridization of Mouse Embryo Sections: 

The full length mouse Beer cDNA (Sequence ID No. 1 1 ) was cloned 
into the pCR2.I vector (Invitrogen, Carlsbad, CA) in the antisense and sense direction 
15 using the manufacturer's protocol. 35 S-alpha-GTP-labeled cRNA sense and antisense 
transcripts were synthesized using in-vitro transcription reagents supplied by Ambion, 
Inc (Austin, TX). In-situ hybridization was performed according to the protocols of 
Lyons et al. (./. Cell Biol. 7/7:2427-2436, 1990). 

The mouse Beer cRNA probe detected a specific message expressed in 
20 the neural tube, limb buds, blood vessels and ossifying cartilages of developing mouse 
embryos. Panel A in Figure 3 shows expression in the apical ectodermal ridge (aer) of 
the limb (1) bud, blood vessels (bv) and the neural tube (nt). Panel B shows expression 
in the 4 th ventricle of the brain (4). Panel C shows expression in the mandible (ma) 
cervical vertebrae (cv), occipital bone (oc), palate (pa) and a blood vessel (bv). Panel 
25 D shows expression in the ribs (r) and a heart valve (va). Panel A is a transverse 
section of 10.5 dpc embryo. Panel B is a sagittal section of 12.5 dpc embryo and 
panels C and D are sagittal sections of 15.5 dpc embryos. 

ba=branchial arch, h=heart, te=telencephalon (forebrain), b=brain, f— frontonasal mass, 
g=gut, h=heart, j=jaw, li=liver, lu=lung, ot=otic vesicle, ao=, sc=spinal cord, 
30 skm=skeletal muscle, ns=nasal sinus, th=thymus , to=tongue, fl=forelimb, 
di=diaphragm 



EXAMPLE 3 

35 Expression and Purification of Recombinant Beer Protein 



A. Expression in COS-1 Cells: 
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The DNA sequence encoding the full length human Beer protein was 
amplified using the following PCR oligonucleotide primers: The 5’ oligonucleotide 
primer had the sequence 5 ’ - A AGCTTGGT ACC ATGC AGCT CCC AC-3 ’ (SEQ ID 
NO:23) and contained a Hindlll restriction enzyme site (in bold) followed by 19 
5 nucleotides of the Beer gene starting 6 base pairs prior to the presumed amino terminal 
start codon (ATG). The 3’ oligonucleotide primer had the sequence 5’- 

AAGCTTCTA CTTGTCATCGTCGTCCT TGTAGTC GTAGGCGTTCTCCAGCT-3 ’ 
(SEQ ID NO:24) and contained a Hindlll restriction enzyme site (in bold) followed by 
a reverse complement stop codon (CTA) followed by the reverse complement of the 
10 FLAG epitope (underlined, Sigma-Aldrich Co., St. Louis, MO) flanked by the reverse 
complement of nucleotides coding for the carboxy terminal 5 amino acids of the Beer. 
The PCR product was TA cloned (“Original TA Cloning Kit”, Invitrogen, Carlsbad, 
CA) and individual clones were screened by DNA sequencing. A sequence-verified 
clone was then digested by Hindlll and purified on a 1.5% agarose gel using a 
15 commercially available reagents (“QIAquick Gel Extraction Kit”, Qiagen Inc., 
Valencia, CA). This fragment was then ligated to Hindlll digested, phosphatase-treated 
pcDNA3.1 (Invitrogen, Carlsbad, CA) plasmid with T4 DNA ligase. DH10B K coli 
were transformed and plated on LB, 100 pg/ml ampicillin plates. Colonies bearing the 
desired recombinant in the proper orientation were identified by a PCR-based screen, 
20 using a 5’ primer corresponding to the T7 promoter/priming site in pcDNA3.1 and a 3’ 
primer with the sequence 5’- GCACTGGCCGGAGC AC ACC-3’ (SEQ ID NO:25) that 
corresponds to the reverse complement of internal BEER sequence. The sequence of 
the cloned fragment was confirmed by DNA sequencing. 

COS-1 cells (ATCC# CRL-1650) were used for transfection. 50 pg of 
25 the expression plasmid pcDNA-Beer-Flag was transfected, using a commercially 
available kit following protocols supplied by the manufacturer (“DEAE-Dextran 
Transfection Kit”, Sigma Chemical Co., St. Louis, MO). The final media following 
transfection was DMEM (Life Technologies, Rockville, MD) containing 0.1% Fetal 
Bovine Serum. After 4 days in culture, the media was removed. Expression of 
30 recombinant BEER was analyzed by SDS-PAGE and Western Blot using anti-FLAG 
M2 monoclonal antibody (Sigma-Aldrich Co., St. Louis, MO). Purification of 
recombinant BEER protein was performed using an anti-FLAG M2 affinity column 
(“Mammalian Transient Expression System”, Sigma-Aldrich Co., St. Louis, MO). The 
column profile was analyzed via SDS-PAGE and Western Blot using anti-FLAG M2 
35 monoclonal antibody. 
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B. Expression in SF9 insect cells: 

The human Beer gene sequence was amplified using PCR with standard 
conditions and the following primers: 

Sense primer: 5 ! -GTCGTCGGATCCATGGGGTGGCAGGCGTTCAAGAATGAT-3’ 
5 (SEQ ID N0.26) 

Antisense primer: 5 ’-GTCGTC AAGCTTCT ACTT GTC ATCGTCCTTGTAGTCGTA 
GGCGTTCTCCAGCTCGGC-3’ (SEQ ID N0.27) 

The resulting cDNA contained the coding region of Beer with two 
modifications. The N-terminal secretion signal was removed and a FLAG epitope tag 
10 (Sigma) was fused in frame to the C-terminal end of the insert. BamHl and Hindlll 
cloning sites were added and the gene was subcloned into pMelBac vector (Invitrogen) 
for transfer into a baculoviral expression vector using standard methods. 

Recombinant baculoviruses expressing Beer protein were made using the 
Bac-N-Blue transfection kit (Invitrogen) and purified according to the manufacturers 
15 instructions. 

SF9 cells (Invitrogen) were maintained in TNM_FH media (Invitrogen) 
containing 10% fetal calf serum. For protein expression, SF9 cultures in spinner flasks 
were infected at an MOI of greater than 10. Samples of the media and cells were taken 
daily for five days, and Beer expression monitored by western blot using an anti-FLAG 
20 M2 monoclonal antibody (Sigma) or an anti-Beer rabbit polyclonal antiserum. 

After five days the baculovirus-infected SF9 cells were harvested by 
centrifugation and cell associated protein was extracted from the cell pellet using a high 
salt extraction buffer (1.5 M NaCl, 50 mM Tris pH 7.5). The extract (20 ml per 300 
ml culture) was clarified by centrifugation, dialyzed three times against four liters of 
25 Tris buffered saline (150 mM NaCl, 50 mM Tris pH 7.5), and clarified by 
centrifugation again. This high salt fraction was applied to Hitrap Heparin (Pharmacia; 
5 ml bed volume), washed extensively with HEPES buffered saline (25 mM HEPES 
7.5, 150 mM Nacl) and bound proteins were eluted with a gradient from 150 mM NaCl 
to 1200 mM NaCl. Beer elution was observed at aproximately 800 mM NaCl. Beer 
30 containing fractions were supplemented to 10% glycerol and 1 mM DTT and frozen at 
-80 degrees C. 

EXAMPLE 4 

35 PREPARATION AND TESTING OF POLYCLONAL ANTIBODIES TO BEER, GREMLIN, AND 

Dan 
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A. Preparation of antigen: 

The DNA sequences of Human Beer, Human Gremlin, and Human Dan 
were amplified using standard PCR methods with the following oligonucleotide 
primers: 

5 H. Beer 

Sense: 5' -G ACTTGGAT CCC AGGGGT GGC AGGCGTTC- 3’ (SEQ ID NO:28) 
Antisense 5 1 - AGC AT AAGCTTCT AGT AGGCGTT CTCC AG- 3' (SEQ ID NO:29) 

H. Gremlin 

Sense: 5' -GACTT GGAT CCGAAGGGA AAAAGA A AGGG- 3' (SEQ ID NO:30) 

10 Antisense: 5' -AGC AT AAGCTTTT A ATCC AA ATCG AT GGA- 3' (SEQ ID NO:3 1) 

H. Dan 

Sense: 5 1 -ACTACGAGCTCGGCCCCACCACCCATCAACAAG- 3' (SEQ ID NO:32) 
Antisense: 5' -ACTTAGAAGCTTTCAGTCCTCAGCCCCCTCTTCC-3' (SEQ ID 
NO:33) 

15 In each case the listed primers amplified the entire coding region minus 

the secretion signal sequence. These include restriction sites for subcloning into the 
bacterial expression vector pQE-30 (Qiagen Inc., Valencia, CA) at sites BamHI/Hindlll 
for Beer and Gremlin, and sites Sacl/Hindlll for Dan. pQE30 contains a coding 
sequence for a 6x His tag at the 5' end of the cloning region. The completed constructs 
20 were transformed into £. coll strain M-15/pRep (Qiagen Inc) and individual clones 
verified by sequencing. Protein expression in M-I5/pRep and purification (6xHis 
affinity tag binding to Ni-NTA coupled to Sepharose) were performed as described by 
the manufacturer (Qiagen, The QIAexpressionist). 

The E. coh - derived Beer protein was recovered in -significant quantity 
25 using solubilization in 6M guanidine and dialyzed to 2-4M to prevent precipitation 
during storage. Gremlin and Dan protein were recovered in higher quantity with 
solubilization in 6M guanidine and a post purification guanidine concentration of 0.5M. 

B. Production and testing of polyclonal antibodies: 

30 Polyclonal antibodies to each of the three antigens were produced in 

rabbit and in chicken hosts using standard protocols (R & R Antibody, Stanwood, WA; 
standard protocol for rabbit immunization and antisera recovery; Short Protocols in 
Molecular Biology. 2nd edition. 1992. 11.37- 11.41. Contributors Helen M. Cooper 
and Yvonne Paterson; chicken antisera was generated with Strategic Biosolutions, 
35 Ramona, C A). 

Rabbit antisera and chicken egg Igy fraction were screened for activity 
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via Western blot. Each of the three antigens was separated by PAGE and transferred to 
0.45um nitrocellulose (Novex, San Diego, CA). The membrane was cut into strips with 
each strip containing approximately 75 ng of antigen. The strips were blocked in 3% 
Blotting Grade Block (Bio-Rad Laboratories, Hercules, CA) and washed 3 times in IX 
5 Tris buffer saline (TBS) /0.02% TWEEN buffer. The primary antibody 
(preimmunization bleeds, rabbit antisera or chicken egg IgY in dilutions ranging from 
1:100 to 1:10,000 in blocking buffer) was incubated with the strips for one hour with 
gentle rocking. A second series of three washes IX TBS/0. 02%TWEEN was followed 
by an one hour incubation with the secondary antibody (peroxidase conjugated donkey 
10 anti-rabbit, Amersham Life Science, Piscataway, NJ; or peroxidase conjugated donkey 
anti-chicken, Jackson ImmunoResearch, West Grove, PA). A final cycle of 3X washes 
of IX TBS/0.02%TWEEN was performed and the strips were developed with Lumi- 
Light Western Blotting Substrate (Roche Molecular Biochemicals, Mannheim, 
Germany). 

15 

C. Antibody cross-reactivity test: 

Following the protocol described in the previous section, nitrocellulose 
strips of Beer, Gremlin or Dan were incubated with dilutions (1:5000 and 1 : 10,000) of 
their respective rabbit antisera or chicken egg IgY as well as to antisera or chicken egg 
20 Igy (dilutions 1:1000 and 1:5000) made to the remaining two antigens. The increased 
levels of nonmatching antibodies was performed to detect low affinity binding by those 
antibodies that may be seen only at increased concentration. The protocol and duration 
of development is the same for all three binding events using the protocol described 
above. There was no antigen cross-reactivity observed for any of the antigens tested. 

25 

EXAMPLE 5 

INTERACTION OF BEER WITH TGF-BETA SUPER-FAMILY PROTEINS 

The interaction of Beer with proteins from different phylogenetic arms 
30 of the TGF-P superfamily were studied using immunoprecipitation methods. Purified 
TGFP-1, TGFP~2, TGFp-3, BMP-4, BMP-5, BMP-6 and GDNF were obtained from 
commerical sources (R&D systems; Minneapolis, MN). A representative protocol is as 
follows. Partially purified Beer was dialyzed into HEPES buffered saline (25 mM 
HEPES 7.5, 150 mM NaCl). Immunoprecipitations were done in 300 ul of IP buffer 
35 (150 mM NaCl, 25 mM Tris pH 7.5, ImM EDTA, 1.4 mM P-mercaptoethanol, 0.5 % 

triton X 100, and 10% glycerol). 30 ng recombinant human BMP-5 protein (R&D 
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systems) was applied to 15 ul of FLAG affinity matrix (Sigma; St Louis MO)) in the 
presence and absence of 500 ng FLAG epitope-tagged Beer. The proteins were 
incubated for 4 hours @ 4°Cand then the affinity matrix-associated proteins were 
washed 5 times in IP buffer (1 ml per wash). The bound proteins were eluted from the 
5 affinity matrix in 60 microliters of IX SDS PAGE sample buffer. The proteins were 
resolved by SDS PAGE and Beer associated BMP-5 was detected by western blot using 
anti-BMP-5 antiserum (Research Diagnostics, Inc) (see Figure 5). 

BEER Ligand Binding Assay: 

10 FLAG-Beer protein (20 ng) is added to 100 ul PBS/0.2% BSA and 

adsorbed into each well of 96 well microtiter plate previously coated with anti-FLAG 
monoclonal antibody (Sigma; St Louis MO) and blocked with 10% BSA in PBS This 
is conducted at room temperature for 60 minutes. This protein solution is removed and 
the wells are washed to remove unbound protein. BMP-5 is added to each well in 
15 concentrations ranging from 10 pM to 500 nM in PBS/0.2% BSA and incubated for 2 
hours at room temperature. The binding solution is removed and the plate washed with 
three times with 200ul volumes of PBS/0.2% BSA. BMP-5 levels are then detected 
using BMP-5 anti-serum via ELISA (F.M. Ausubel et al (1998) Current Protocols in 
Mol Biol. Vol 2 11.2.1-11.2.22). Specific binding is calculated by subtracting non- 
20 specific binding from total binding and analyzed by the LIGAND program (Munson 
and Podbard, Anal. Biochem., 107, p220-239, (1980). 

In a variation of this method, Beer is engineered and expressed as a 
human Fc fusion protein. Likewise the ligand BMP is engineered and expressed as 
mouse Fc fusion. These proteins are incubated together and the assay conducted as 
25 described by Mellor et al using homogeneous time resolved fluorescence detection 
(GW. Mellor et al., J o/Biomo/ Screening, 3(2) 91-99, 1998). 

EXAMPLE 6 

30 Screening Assay for Inhibition of TGF-beta binding-protein 

Binding to TGF-beta Family Members 
The assay described above is replicated with two exceptions. First, 
BMP concentration is held fixed at the Kd determined previously. Second, a collection 
of antagonist candidates is added at a fixed concentration (20 uM in the case of the 
35 small organic molecule collections and 1 uM in antibody, studies). These candidate 
molecules (antagonists) of TGF-beta binding-protein binding include organic 
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compounds derived from commercial or internal collections representing diverse 
chemical structures. These compounds are prepared as stock solutions in DMSO and 
are added to assay wells at < 1% of final volume under the standard assay conditions. 
These are incubated for 2 hours at room temperature with the BMP and Beer, the 
5 solution removed and the bound BMP is quantitated as described. Agents that inhibit 
40% of the BMP binding observed in the absence of compound or antibody are 
considered antagonists of this interaction. These are further evaluated as potential 
inhibitors based on titration studies to determine their inhibition constants and their 
influence on TGF-beta binding-protein binding affinity. Comparable specificity 
10 control assays may also be conducted to establish the selectivity profile for the 
identified antagonist through studies using assays dependent on the BMP ligand action 
(e.g. BMP/BMP receptor competition study). 

15 EXAMPLE 7 

Inhibition of TGF-beta bending-protein localization to Bone Matrix 

Evaluation of inhibition of localization to bone matrix (hydroxyapatite) 
is conducted using modifications to the method of Nicolas (Nicolas, V. Ca/cif Tissue 
lnt 5/: 206, 1995). Briefly, '‘^-labelled TGF-beta binding-protein is prepared as 
20 described by Nicolas (supra). Hydroxyapatite is added to each well of a 96 well 
microtiter plate equipped with a polypropylene filtration membrane (Polyfiltroninc, 
Weymouth MA). TGF-beta binding-protein is added to 0.2% albumin in PBS buffer. 
The wells containing matrix are washed 3 times with this buffer. Adsorbed TGF-beta 
binding-protein is eluted using 0.3M NaOH and quantitated. 

25 . Inhibitor identification is conducted via incubation of TGF-beta binding- 

protein with test molecules and applying the mixture to the matrix as described above. 
The matrix is washed 3 times with 0.2% albumin in PBS buffer. Adsorbed TGF-beta 
binding-protein is eluted using 0.3 M NaOH and quantitated. Agents that inhibit 40% 
of the TGF-beta binding-protein binding observed in the absence of compound or 
30 antibody are considered bone localization inhibitors. These inhibitors are further 
characterized through dose response studies to determine their inhibition constants and 
their influence on TGF-beta binding-protein binding affinity. 
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EXAMPLE 8 

Construction of TGF-beta binding-protein Mutant 

A. Mutagenesis: 

A full-length TGF-beta binding-protein cDNA in pBluescript SK serves 
5 as a template for mutagenesis. Briefly, appropriate primers (see the discussion 
provided above) are utilized to generate the DNA fragment by polymerase chain 
reaction using Vent DNA polymerase (New England Biolabs, Beverly, MA). The 
polymerase chain reaction is run for 23 cycles in buffers provided by the manufacturer 
using a 57°C annealing temperature. The product is then exposed to two restriction 
10 enzymes and after isolation using agarose gel electrophoresis, ligated back into pRBP4- 
503 from which the matching sequence has been removed by enzymatic digestion. 
Integrity of the mutant is verified by DNA sequencing. 

B. Mammalian Cell Expression and Isolation of Mutant TGF-beta binding-protein: 

15 The mutant TGF-beta binding-protein cDNAs are transferred into the 

pcDNA3. 1 mammalian expression vector described in EXAMPLE 3. After verifying 
the sequence, the resultant constructs are transfected into COS-1 cells, and secreted 
protein is purified as described in EXAMPLE 3. 



20 

EXAMPLE 9 
Animal Models -I 

Generation of Transgenic Mice Overexpressing the Beer Gene 

The -200 kilobase (kb) BAC clone 1 5G5, isolated from the CITB mouse 
25 genomic DNA library (distributed by Research Genetics, Huntsville, AJL) was used to 
determine the complete sequence of the mouse Beer gene and its 5' and 3' flanking 
regions. A 41 kb Sail fragment, containing the entire gene body, plus -17 kb of 5' 
flanking and —20 kb of 3' flanking sequence was sub-cloned into the BamHI site of the 
SuperCosI cosmid vector (Stratagene, La Jolla, CA) and propagated in the E. coli strain 
30 DH10B. From this cosmid construct, a 35 kb Mlul - Avill restriction fragment 
(Sequence No. 6), including the entire mouse Beer gene, as well as 17 kb and 14 kb of 
5' and 3' flanking sequence, respectively, was then gel purified, using conventional 
means, and used for microinjection of mouse zygotes (DNX Transgenics; US Patent 
No. 4,873,191). Founder animals in which the cloned DNA fragment was integrated 
35 randomly into the genome were obtained at a frequency of 5-30% of live-born pups. 
The presence of the transgene was ascertained by performing Southern blot analysis of 
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genomic DNA extracted from a small amount of mouse tissue, such as the tip of a tail. 
DNA was extracted using the following protocol: tissue was digested overnight at 55° 
C in a lysis buffer containing 200 mM NaCl, 100 mM Tris pH8.5, 5 mM EDTA, 0.2% 
SDS and 0.5 mg/ml Proteinase K. The following day, the DNA was extracted once 
5 with phenol/chloroform (50:50), once with chloroform/isoamylalcohol (24:1) and 
precipitated with ethanol. Upon resuspension in TE (lOmM Tris pH7.5, 1 mM EDTA) 
8-10 ug of each DNA sample were digested with a restriction endonuclease, such as 
EcoRI, subjected to gel electrophoresis and transferred to a charged nylon membrane, 
such as HyBondN+ (Amersham, Arlington Heights, IL ). The resulting filter was then 
10 hybridized with a radioactively labelled fragment of DNA deriving from the mouse 
Beer gene locus, and able to recognize both a fragment from the endogenous gene locus 
and a fragment of a different size deriving from the transgene. Founder animals were 
bred to normal non-transgenic mice to generate sufficient numbers of transgenic and 
non-transgenic progeny in which to determine the effects of Beer gene overexpression. 
15 For these studies, animals at various ages (for example, 1 day, 3 weeks, 6 weeks; 4 
months) are subjected to a number of different assays designed to ascertain gross 
skeletal formation, bone mineral density, bone mineral content, osteoclast and 
osteoblast activity, extent of endochondral ossification, cartilage formation, etc. The 
transcriptional activity from the transgene may be determined by extracting RNA from 
20 various tissues, and using an RT-PCR assay which takes advantage of single nucleotide 
polymorphisms between the mouse strain from which the transgene is derived 
(129Sv/J) and the strain of mice used for DNA microinjection [(C57BL5/J x SJL/J)F2], 

Animal Models - Ti 

25 Disruption of the Mouse BEER Gene by Homologous Recombination 

Homologous recombination in embryonic stem (ES) cells can be used to 
inactivate the endogenous mouse Beer gene and subsequently generate animals carrying 
the loss-of-function mutation. A reporter gene, such as the E. coli fi-galactosidase 
gene, was engineered into the targeting vector so that its expression is controlled by the 
30 endogenous Beer gene's promoter and translational initiation signal. In this way, the 
spatial and temporal patterns of Beer gene expression can be determined in animals 
carrying a targeted allele. 

The targeting vector was constructed by first cloning the drug-selectable 
phosphoglycerate kinase (PGK) promoter driven neomycin-resistance gene (neo) 
35 cassette from pGT-N29 (New England Biolabs, Beverly, MA) into the cloninu vector 
pSP72 (Promega, Madson, WI). PCR was used to flank the PGK neo cassette with 
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bacteriophage PI loxP sites, which are recognition sites for the PI Cre recombinase 
(Hoess et al., PNAS USA, 79:3398, 1982). This allows subsequent removal of the neo- 
resistance marker in targeted ES cells or ES cell-derived animals (US Patent 
4,959,317). The PCR primers were comprised of the 34 nucleotide (ntd) loxP 
5 sequence, 15-25 ntd complementary to the 5' and 3' ends of the PGKneo cassette, as 
well as restriction enzyme recognition sites (BamHI in the sense primer and EcoRI in 
the anti-sense primer) for cloning into pSP72. The sequence of the sense primer was 
5’- 

AATCTGGAT CC AT A ACTTCGT AT AGC ATAC ATT AT ACGA AGTT ATCTGC AG 
K) GATTCGAGGGCCCCT-3 1 (SEQ ID NO:34); sequence of the anti-sense primer was 
S’-AATCTGAATTCCACCGGTGTTAATTAAATAACTTCGT 
AT A AT GT AT GCT AT ACGA AGTT AT AG ATCT A GAG TCAGCTTCTGA-3’ (SEQ 
ID NO:35). 

The next step was to clone a 3.6 kb Xhol-Hindlll fragment, containing 
15 the E. co/i fi-gci/actosicJase gene and SV40 polyadenylation signal from pSVP 
(Clontech, Palo Alto, CA) into the pSP72-PGKneo plasmid. The "short arm" of 
homology from the mouse Beer gene locus was generated by amplifying a 2.4 kb 
fragment from the BAC clone 15G5. The 3’ end of the fragment coincided with the 
translational initiation site of the Beer gene, and the anti-sense primer used in the PCR 
20 also included 30 ntd complementary to the 5' end of the fi-galactosidase gene so that its 
coding region could be fused to the Beer initiation site in-frame. The approach taken 
for introducing the "short arm" into the pSP72-(3gal-PGKneo plasmid was to linearize 
the plasmid at a site upstream of the p -gal gene and then to co-transform this fragment 
with the "short arm" PCR product and to select for plasmids in which the PCR product 
25. was integrated by homologous recombination. The sense primer for the "short arm" 
amplification included 30 ntd complementary to the pSP72 vector to allow' for this 
recombination event. The sequence of the sense primer was 5'-ATTTAGGTGACACT 
AT AGA ACTCGAGC AGCT GAAGCTT AACC AC ATGGT GGCTC AC AACC AT-3 ' 
(SEQ ID NO:36) and the sequence of the anti-sense primer was 5'- 
30 AACGACGGCCAGTGAATCCGTA 

ATC ATGGTCAT GCT GCC AGGT GG AGGAGGGC A-3' (SEQ ID NO:37). 

The "long arm" from the Beer gene locus was generated by amplifying a 
6. 1 kb fragment from BAC clone 1 5G5 with primers which also introduce the rare- 
cutting restriction enzyme sites SgrAI, Fsel, Ascl and Pacl. Specifically, the sequence 
35 of the sense primer was 5'-ATTACCACCGGTGACACCCGCTTCCTGACAG-3’ (SEQ 
ID NO:38); the sequence of the anti-sense primer was 5'- 
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ATT ACTT AATTAA AC AT GGCGCGCC AT 

ATGGCCGGCCCCTAATTGCGGCGCATCGTTAATT-3' (SEQ ID NO 39). The 
resulting PCR product was cloned into the TA vector (Invitrogen, Carlsbad, CA ) as an 
intermediate step. 

5 The mouse Beer gene targeting construct also included a second 

selectable marker, the herpes simplex virus I thymidine kinase gene (HS VTK) under the 
control of rous sarcoma virus long terminal repeat element (RSV LTR). Expression of 
this gene renders mammalian cells sensitive (and inviable) to gancyclovir; it is 
therefore a convenient way to select against neomycin-resistant cells in which the 
10 construct has integrated by a non-homologous event (US Patent 5,464,764). The 
RSVLTR-HSVTK cassette was amplified from pPS1337 using primers that allow 
subsequent cloning into the Fsel and AscI sites of the "long arm"-TA vector plasmid. 
For this PCR, the sequence of the sense primer was 5'- 
ATTACGGCCGGCCGCAAAGGAATTCAAGA TCTGA-3' (SEQ ID NO 40); the 
15 sequence of the anti-sense primer was 5'-ATTACGGCGCGCCCCTC 
AC AGGCCGC ACCC AGCT -3 ' (SEQ ID NO:41). 

The final step in the construction of the targeting vector involved 
cloning the 8.8 kb SgrAI-AscI fragment containing the "long arm" and RSVLTR- 
HSVTK gene into the SgrAI and Ascl sites of the pSP72-"short arm"-pgal-PGKneo 
20 plasmid. This targeting vector was linearized by digestion with either Ascl or Pad 
before electroporation into ES cells. 



EXAMPLE 10 

25 Antisense-mediated Beer Inactivation 

17-nucleotide antisense oligonucleotides are prepared in an overlapping 
format, in such a way that the 5' end of the first oligonucleotide overlaps the translation 
initiating AUG of the Beer transcript, and the 5' ends of successive oligonucleotides 
occur in 5 nucleotide increments moving in the 5’ direction (up to 50 nucleotides 
30 away), relative to the Beer AUG. Corresponding control oligonucleotides are designed 
and prepared using equivalent base composition but redistributed in sequence to inhibit 
any significant hybridization to the coding mRNA. Reagent delivery to the test cellular 
system is conducted through cationic lipid delivery (P.L. Feigner, Proc. Natl. Acad 
Sci. USA 84: 7413, 1987). 2 ug of antisense oligonucleotide is added to 100 ul of 
35 reduced scrum media (Opti-MEM 1 reduced serum media, Life Technologies, 
Gaithersburg MD) and this is mixed with Lipofectin reagent (6 ul) (Life Technologies, 
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Gaithersburg MD) in the 100 ul of reduced serum media. These are mixed, allowed to 
complex for 30 minutes at room temperature and the mixture is added to previously 
seeded MC3T3E21 or KS483 cells. These cells are cultured and the mRNA recovered. 
Beer mRNA is monitored using RT-PCR in conjunction with Beer specific primers. In 
5 addition, separate experimental wells are collected and protein levels characterized 
through western blot methods described in Example 4. The cells are harvested, 
resuspended in lysis buffer (50 mM Tris pH 7.5, 20 mM NaCl, ImM EDTA, 1% SDS) 
and the soluble protein collected. This material is applied to 10-20 % gradient 
denaturing SDS PAGE. The separated proteins are transferred to nitrocellulose and 
10 the western blot conducted as above using the antibody reagents described. In parallel, 
the control oligonucleotides are added to identical cultures and experimental operations 
are repeated. Decrease in Beer mRNA or protein levels are considered significant if the 
treatment with the antisense oligonucleotide results in a 50% change in either instance 
compared to the control scrambled oligonucleotide. This methodology enables 
15 selective gene inactivation and subsequent phenotype characterization of the 
mineralized nodules in the tissue culture model. 
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SEQUENCES 

Sequence ID No. 1: Human BEER cDNA (complete coding region plus 5 ' and 3’ 

UTRs) 

5 

.^.'SCCTGTGCTACTfJCVAGGTGGCGTGCCCTCCTCTGGCTGGTACCATGCAGCTCCCACTGGCCCTGTGTCTCGTCTSC 

Cn'GCrTGGTAOACACAGCCTTCCGTGTAGTGGAGGGCCAGGGGTCKiCAGGCGTTCAAGAATGATGCCACGGAAATCATCCC 

cga^ctcgg.^.gtaclnjc^agcctccaccggagctggagaqcaac^gaccats^cc::4gc.cc^:;ag^/j;;gagq.ocggc 

CTCCCCACCACCCCTTTC4A.GA£CAAAGA.CGTGTCCGAGTACAGCTGCCGCGAGCTC : CACTTC\ACCCGCTACGTGACCGAT 
10 GGGC:CGTGC:rGC:AGGGCCAAGCCGGTCACCGAGCTGGTGTGCTCCGGCCAGTGCGGCCCGGCGCGCCTGCT3CCCAACGC 
G\ATCGGCCGCGGCAAGTGG?GGCGACCTAGTGGGCCCGACTTCCGCTGCATCCCCGACC3CTACCGCGCGCAGCGCGTGC; 
.AGCTGCTGT GTCCCGGT GGTG AGGCGCt XiCGCGCGCGCAAGGT GCGCCTGGTGGCCTCGT G CAAGT G CAAGC GCCTCACC 
CGCT TCCAC AACCAGT CGGAGCT CAAGGACTTCGGGACCGAGGCCGCT C GGCCG CAGAAGGG CCGGAAGCC GC3GC CC CG 
CGCCCGGAGCGCCAAAGCCAACCAGGCCCAGCTGGACAACGCCTACTAGAGCCCGCCCGCGCCCCTCCCCACCGGCGGGC 
15 GCCCCGGCCCTGAACCCGCGCCCCACATTTCTGTCCTCTGCGCGTGGTTTGATTGTTTATATTTCATTGT.A-ATGCCTGC 

a-.cccagggcagggggctgagaccttccaggccctgaggaa.tccc:gggcgccggcaaggcccccctcagcccgccagctg 

aggggtcccacggggcaggggagggaattgagagtcacagacactgagccacgcagctccgcctctggggccgcctacct 

TTGCTGGTCCCACTTCAGAGGAGGCAG.AAATGGAAGCATTTTCACCGCCCTGGGGTTTT.AAGGGAGCGGTGTGGGAGTGG 
GAAAGTCCAGGGACTGGTTAAGA.AAGTTGGAT.AAGATTCCCCCTTGCACCTCGCTGCCCATCAGAAAGCCTGAGGCGTGC 
20 CCAGAGCACAAGACTGGGGGCAACTGTAGATC-TGGTTTCTAGTCCTGGCTCTGCCACTAACTTGCTGTGTAACCTTGAAC 
T AC : AC : AAT T C TCCT TCGG G AC CT C AAT T T CCA.CTT ?G T AAAAT G AGGG TGGAGGTGG G AAT AC G AT CT C G AGG.AGACT AT 
T GGC AT AT GATT C C AAGG ACT CCA.GT GCCTT T TG AAT GGGCAdAG GTG AG AG AG A.G AG.AG AGAA.AGAGAGAGAATGAAT G 
CAG7TGCAT TGATTCAGT GCC AAGGTC AC TTC C AGAATT CAGAGTTGT GATGCT CTCTTCTG ACAGCCAAAG AT 3AAAAA 
CA.AACAGAAAAAA a AAAGTAAAGAGTCTATTTATGGCTGACATATTTA.CGGCTGACAAACTCCTGGAAGAAGCTATGCTG 
2 5 CTTCCCAGCCTGGCTTCCCCGGATGTTTGGCTACCTCCACCCCTCCAT CTCAAAGAAATAACAT CATCCATT GG3GTAGA 

AAAGGAGAGCGTCCGAGGGTGGTGGGAGGGATAGAAATCACATCCGCCCCAACTTCCCAAAGAGCAGCATCCCTCCCCCG 
ACCCATAGCCATGTTTTAAAGTCACCTTCCGAAGAGAAGTGAAAGGTTCAAGGACACTGGCCTTGCA.GGCCCGAGGGAGC 
AGCCATCACAA.ACT CACAGACCAGCACATCCCTTTTGAGACACCGGCT TCTGCCCACCACT CAC'GGACACAT 7T CTGCCT 
AGAAAAC AG C7T CT T AC T GCTCT T AC ATGTGAT GG CAT AT CT T ACA.CT AAAAG AAT ATT AT TGGGGG AAAAACT ACAAGT 
30 GCTGTACATATGCTGAGAAACTGCAGAGCATAA/TAGCTGCCACCCAAAAATCTTTTTGAAAA/rCATTTCCAGAC.AACCTC 
TT ACTTTCT GTGT AGTTT TTAAT TGTTAAAAAAAAAAAGTTTTAAACAGAAGCACAT GACAT ATGAAAGCCTCCAGGACT 
GGTCGTTTT TTT GG CAA.T TCTTCCACGTGGGACTT GT CCACAAG AATG AAAGT AGTGGTTTTT AAAGAGTTAA GTT ACAT 
ATTTATTTTCTCAC TT AAGTTAT TTA7GCAAAAGT TT TTCTTGTAGAGAATGACAATGTTAAT ATTGCTTT A7GAATTAA 
CAG TCTGTTCTT C C AG AG T CC AG AG AC AT TGT T AAT AAAG AC AATG AAT CAT GACCGAAAG 
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Sequence ID No. 2: Human BEER protein (complete sequence) 

MQL F LALC LV CL LVH T A FR WEGQGWQAFKN DAT El I F E L3EY FE FFFELEN NKTMNRAENGGRF F'HHFFET K DVS E Y SC 
RELH FTRYVT Do F CR 3AK F VTELVCSGQCG PAR L LPNA7 GRGKWWRFSG FDFRCI F DR Y RAQRVQL LC FGGEA ? RARKVR 

5 lvasckckrltrfhnqselkdfgteaarpqkgrkprprarsakanqaelenay 



Sequence ID No. 3: Human Beer cDNA containing Sclerosteosis nonsense 

1 0 mutation 

AGAGCCTGTGCTACTGa-AGGTGGCGTGCCCTCCTCTGGCTGGTACCATGCAGCTCCCACTGGCCCTGTGTCTCGTCTGC 

CtgcTGGTACACACAGCCTTCCGTGTAGTGGAGGGCTAGGGGTGGCAGGCGTTCAAG.AATGATGCCACGGAAATCATCC'j 

CGAGCTCGGAGAGTACCCCGAGCCTCCACCGGAGCTGGAGARC.AACAAGACCATGAACXGGGCGGAGAAOGGAGGGCGGC 

1 5 ctccccAccacccctttgagaccaaagacgtgtccgagtacagctgccgcgagctccacttcacccgctacgtgaccgat 

GGGCCGTGCCCCAGCGCCAAGCCGGTCACCGAGCTGGTGTGCTCCGGCCAGTGCGGCCCGGCGCGCCTGCTGCCCAACGC 

catcggccgcggcaa.gtggtggcgacctagtgggcccgacttccgctgcatccccgaccgctaccgcgcgcagcgcgtgc 

AGCTGCTGTGTCCCGGTGGTGA.GGCGCCGCGCGCGCGCAA.GGTGCGCCTGGTGGCCTCGTGCAAGTGCAA.GCGCC'i’CA.CC 
CGCTTCCACAACCAGTCGGAGCTCAAGGACTTCGGGACCGAGGCCCX’TCGGCCGCAGAAGGGCCGGAAGCCGCGGCCCCG 
20 CGCCCGGAGCGCCAAAGCCAA.CCAGGGCGAGCTGGAGAACGCCTACTAGAGCCCGCCCGCGCCCCTCCCCACCGGCGGGC 
GCCCCGGCCCTGAA.CCCGCGCCCCACATTTCTGTCCTCTOCGCGTGGTTTGATTGTTTATATTTCATTGTAAA.TGCCTGC 
AACCCAGGGCAG3GGGCTGAGACCTTCCA3GCCCTGAGGAATCCCGGGCGCCGGCAAGGCCCCCCTCAGCCCGCCAGCTG 

AGGGGTCCCA.C G 3GGC A.G GG G AGGG AAT T G AGAG T CACAG AC ACT G AGCC A CGCAGCCCCGCCT CT GG GGCC G CCT AC C T 

TTGCTGGTCCCACTTCAGAGGAGGCAGAAAT-3GA.AGCATTTTCACCGCCCT3GGGTTTTAA.GGGAGCGGTGTGGGAGTGG 
25 G AAAGT CCAG GGACT GGTTAAGAAAGTT GSA.TAAGATTCCCCCTTGCACCVCGCTGCCCATCAGAAAGCCTGAGGCGT GC 

CCAGAGCACAAGACTGGGGGCAACTGTAGATGTGGTTTCTAGTCCTGGCTCTGCCACTAACTTGCTGTGTAACCTTGAAC 

TACACAATTCTCCTTCGGGACCTCAATTTCCACTTT5TAAAATGAGGGTGGAGG7GSC-AATAGGATCTCGAGGAGACTAT 

TGGCATATGATTCCMGGACTCCAGTGCCTTTTGAATC^CAGAGGTGAGAGAGAGAGAGAGAAAGAGAGAGAATGAATG 

CAGTTGCATTGATTCAGTGCCAAGGTCACTTCCAGAATTCAGAGTTGTGATGCTCTCTTCTGACAGCCAAA3ATGAAAAA 

3 0 CAAACAG AAAAAAAAAAGT. AAAGAGT CT ATT T AT GG CT GAC AT AT TTACGGCTGACAAAC T CCTGGAAGAAGCT AT GC T G 

CTTCCCAGCCTGGCTTCCCCGGATGTTTGGCTACCTCCACCCCTCCATCTC.AAAG.AAATAACATCATCCATTGGGGTAGA 

AAAGGAGAGGGT CC G.AGGGT GGTGGG AGGGAT AGAAATC ACATCC G CCC CAACTTCCCAAAGAGCAGCATCCCT CCCCCG 
ACCC AT AGCC AT GT TT T AAAGT CACCTTCCG.AAG AG AAGT G AAAGGTTC AAGG ACACTGGCCTT G C AGG CC CG AGGGAG C 

AGCCATCACAAACTCACAGACCAGCACATCCCTTTTGAGACACCGCCTTCTGCCCACCACTCACGGACACATTTCTGCCT 

AGAAAACAGCTT CTT ACTGCT CTTACATGTG ATGGCAT AT CTT ACACTAAAAGAATATT ATTGGGGGA AAA.ACT AC.AAGT 

GCTGTACATATGCTGAGAAACTGCAGAGCATAATAGCTGCCACCCAAAAATCTTTTTGAAAATCATTTCCAGACAACCTC 
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TT^:TTTf:S"::tVGTAC^TTTT^TTGTT.WAW^AAGTTTT.-A^CAG^.GCACATC-ACATAT^^AGCCTGCAGC*AC: 
GC T CGTTTTTTTGGCAA.TTCTTCCACGTGGGACTTGT CJCACAAG »ATGA AAGTAGT GG T T TT T AAAG AG T TAAGT T.-.C A T 

ATTTATTTTCTCACTTAAGTTATTTATGCAA-AGTTTTTCTTGTAGAGAATGACAATGTTAATATTGCTTTATGAATTAA 

CAGTCT GTT CTTC'CAG A.GTCCA.G AGACA.TTGT TA.ATAAAGAlCAAT GA.A.T CA.T GA.CC GAAAG 

5 

Sequence ID No. 4: Truncated Human Beer protein from Sclerosteosis 

MQL PLA.LCLVC LLVKTAFRVVEG* 

10 

Sequence ID No. 5: Human BEER cDNA encoding protein variant (V10I) 

15 AGAGCCTGTC4CTAC7GGAAGGTGGCGTGCCCTCCTCTGGCTGGTACCATGCAGCTCCCACTGGCCC7GTGTCTCATCTGC 

CTuCTGG : ...C.-.^ACAGCCTTCCGT,,T AGTGG.AGGGCCAGGGGT GGCA.GGCG TTC.AAGAA.TGATGCCACGGAAAT( ‘AVCCG 

CGAGCTCGGAGAGTACCCCGAGCCTCCACCGGACvCTGGAGAACAAC.AAGACCATGAACCGGGCGGAGAACGGAGGGCGGC 
CTCCuCACCACCCCTTTGAGACCAAAGACGTGTCCGAGTACAGCTGCCGCGAGCTGCACTTCACCCGCTACGTGACCGAT 
GvjuCCGTGCCwCAGCGCCAAGCCGGTCACCGAGCTGGTGTGCTCCGGCCAGTGCGGCCCGGCGCGCCTGCTGCCCA.ACGC 
20 CATCGGCCGt-^‘-tC l AAGTGGTGGCGA.CCTAGTGGGCCCG.ACTTCCGC7CC.ATCCCCGACCGC7ACCGCGCGCA.GCGCGTGC 
AGCTGCTGTGTCCCGGTGGTGA.GGCGCCGCGCGCGCGCAAGGTGCGCCTGGTGGCCTCGTGCAAGTGCAAGCGCCTCP.CC 

cgcttccac.aaccagtcggagc:tcaaggacttcgggaccgaggccgctcggccgcagaagggccggaagccgcggccccg 

CGCCCGGAGCGCCAAAGCC.AA.CCAG3CCGAGCTGGAGAACGCCTACTAGAGCCCGCCCGCGCCCCTCCCCACCGGCGGGC 
. GCCCOj , oCv~CTGAACCCGCGCCCC.ACATTTCTG7CCTC7GCGCGTGGTTTGA.T7G7TTATATTTCATTGTA AAT3C r ' T GC 

25 aacccagggcac-ggggctgagaccttccaggccctgaggaatcccgggcgccggcaaggcccccctcagcccgccagctg 
AGGGGTCCCACGGGGCAGGGGAGGGAATTG.AGAGTCACAGACACTGAGCCACGCAGCCCCGCCTCTGGGGCCGCCTACCT 
TTGCTGGTCCCACTT-CA.GAGGAGGCAG.AAATGGAAGCATTTTCACCGCCCTGGGGTTTTAAGGGAGCGGTGTGGGAGTGG 

GAAAGTC C AGGGACT GGT T AAG.AAAGTTGG AT AAG ATT CCCCCT T GCACCTCG CTGCCC ATC AG AAAG CCTG AGGCG T GC 

CCAGA.GCACAAGACTGGGGGCAACTGTAGATGTGGTTTCTAGTCCTGGCTCTGCCACTAACTTGCTGTGTAACCTTGAAC 
.30' TACACAATTCTCCTTCGGGACCTCAATTTCCACTTTGTAAAATGAGGGTGGAGGTCCCAATAGGATCTCGAGGAGACT AT 
TGGCATATGATTCCAAGGACTCCAGT3CCTTTTGA.ATGGGCAGA.GGTGAGAGAGAGAGAGAGAAAGAGAGAGAATGAATG 

cagttggattgattcagtgccaaggtcacttccagaattcagagttgtga.tgCtctcttctgacagccaaagatgaaaaa 

CAAACAGAAAAAAAAAAGTAAAGAGTCTATTTATGGCTGAGATATTTACGGCTGACAAACTCCTGGAAGAAGCTATGCTS 
CTTCCCAGCCT’jGC . TuCCCGGATGTTTGGCTACCTCGACCCCTCGATC'TCAAAGAAATAAGATCATCGATTGGGGT AG" 

35 ; aaaggagagggtccgacc-gtgctgggagggatagaaatcacatccgccccaacttcgcaaagagcagcatccctcccccg 
ACCCATAGCCATGTTTTAAAGTCACCTTCCGAAGAGAAGTGAMGGTTCAAGGA.CA.CTGGCCTTGCA3GCCCGAG3GAGC 
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A.3CCATCAC.AAAC7CA.C.AGACCAGCACATCCCTTTTGA.G.ACAC.CGCCTTCTGCCCACCA.CTCACGGACACA.T7TCTGCC7 
.AGAAAACAGCTTCTTACTGCTCTTACATGTGATGGCATATCTTACACTAAAA.G.AATATTATTvKSGGGAAAAACTACAA.GT 
■3CT G T ACA.T A TGC7 G AGAAACT GC AG AG CAT AA’ I ‘ AGCTGCGAC CC AAAAA.T CT T TTT G AAAATCATT T C C AG AC AACCTC 
T TACTTTCTGT GTAG7TTT TAATTGTT.AWAAAAAAAGTTTT AAACAGAA.GCACAT G ACAT ATGA.AAGCCT GC AG G ACT 
5 GGTCGTTTTTTTGGCAATTCTTCCACGTGGGACTTGTCC.ACAAGAATGAAAGTAGTGGTTTTTAAAGAGTTAAGTTAC.AT 
ATTT A7TTT CTCACT TAAGTTATT TATGCAAAAGT TT TT CTTGT AGAGAATC-ACAATG7T AAT AT TGCTT TATGAATTAA 
CAGTCTGTTCTTCCAGAGTCCAGAGACATTGTTAmATAAAGAC.:AATG.AATCA7GACCGAAAG 

10 Sequence ID Wo. 6: Human BEER protein variant (V10I) 

— QLf LALCLICLLV'HTAFRWEGQGWQAFKNDATE I IRE LGEY FEF F FELENNKTMNRAEHGGRr fHH F FETKDVSEYSC 
RZ LH FT RYV? D3FCR 5AKFVTE LVCSGQCG EAF.LL FNAI GRGKWWR FSG F Dt'KCI PDRYR AQRVQLLC PGGEA FRA.RKVR 
LVASCKCKR LT R FH M QSE LKDFGT E AAR FQKGRK F RFRAR SAKAM QAE LENAY 

15 

Sequence ID No. 7: Human Beer cDNA encoding protein variant (P38R) 

20 agagcctgtgctactggaaggtggcgtgccctcctctggctggtaccatgcagctcccactggocctgtgtctcgtctgc 
ctgctggtacacacagccttccgtgtactggagggccaggggtggcaggcgttcaagaa.tgatgccacggaaatcatccg 

C GAG CT CGG.AGA.G T ACCC CGA5CCT CCACCGG AGCT GGAG.AACAACAAGAC CATG AACCGGGCGG AGA-ACGGAGGGCGGC 
C T CC CC A.CC .AC CC C T T 7GAGA.CCAAA.3A.CGT GT C C G A.GT A.CA.GCTG CC GCG.AG CT G CACT T CA.C CCGCT A.CGT G.ACCG AT 
GGGCCGTGCCGCAGCGCCAAGCCGGTCACCGAGC7GGTGTGCTCCGGCCA.GTGCGGCCCGGCGCGCCTGCTGCCCAA.CGC 
25 CATCGGCCGCGGCAAGTGGTGGCGACCTAvGTGGGCCCGACTTCCGCTGC.ATCCCCGACCGCTACCGCGCGCAGCGCGTGC 
A.GCTGCTGTGTCCCGGTGGTGA.GGCGCCGCGCGCGCGCAAGGTGCGCCTGGTGGCCTCGTGCAA.GTGCAA.GCGCCTCACC 
CGCTTCCACAACCAGTCGGAGCTCAAGGA.CTTCGGGA.CCGA.GGCCGCTCGGCCGCAGAAGGGCCGGAAGCCGCGGCCCCG 
CGCCCGG.AGCGCCAAA.GCCAACCAGGCCGAGCTGGAGAACGCCTACTAGAGCCCGCCCGCGCCCCTCCCCACCGGCGGGC 
GCCCCGGCCCTGAA.CCCGCGCCCCACA.TTTCTGTCCTCTGCGCGTGGTTTGA.TTGTTTATATTTCATTGTAAATGCCTGC 
30 AACCCAGGGCAGGGGGCTGAGACCTTCCAGGCCCTGAGGAATCCCGGGCGCCGGCAAGGCCCCCCTCAGCCCGCCAGCTG 
A.GSGGTCCCACGoGGCAGGGGAGGGAATTGAGAGTCA.CAGA.CACTGA.GCCACGCAGCCCCGCCTCTGGGGCCGCCTACCT 
TTGCTGGTCCCACTTCAGAGGAGGCAGAAA.TG3AAGCATTTTCACCGCCCTGGGGTTTTAAGGGAGCGGTGTGGGA.GTGG 
GAAA.GTCCAGGGACTGGTTAAGAAAGTTGGATAA.GATTCCCCCTTGCACCTCGCTGCCCA.TCAGAAA.GCCT3AGGC3TGC 
‘CCAGAGC.ACAAGACTGGGGGCAACTGTAGATGTGGTTTCTAGTCCTGGCTCTGCCACTAA.CTTGCTGTGTAACCTTGAAC 
35 T ACACAATT CTCCTTCGGG ACCTCAATTT CCACT TTGTAAAATGAGGGTGG AGGTGGGAATAGGAT CTCGAGGAGACT A.T 

TG3CATATGATTCCAAGGACTCCAGTGCCTTTTG.AA.TGGGCAGA.GGTGA.GAGAGAGAGAGAGAAAGAGAGAGAA7G.AATG 




. WO 00/32773 



PCT/US99/27990 



72 

CAGTTGCATTGATTCAGTGCCA7v3GTCACTTCCAG. a - a .TTCAGA3TTGTGATGCTCTCTTCTCiACAGCCAAAC.iA’JTic fc cL w a.o, 
CAAACAGAAAAAFAAAAG7AAAG.A3?CTATTTATG'iCrGACATATTT.ACGGCTGACAAACTCCTGG.AAGAAGCTATGCTG 
CTT C C CAGCCTGG CT T CC C CGC- AT G T T T G C CT ACCTCC ACCCC TCCAT CTC AAAGAAAT AACATC ATCC AT T GGGGT A GA 
AAAG GAG AG G GT C C G AG G G T GG T GGG.AGGGATAG.AAATCACATCCGCCCCAACTTCCCAAAGAGCAGCATCCCTCCCCCG 
5 ACC CAT AGC C AT G T T TT AAAGT CACCT T CCGAAG AGAAG7 GAAAGGT T CAAGG ACACT GGCCT TGC AGGCCCG AGGGAGC 

AGCCATCACAAACTCACAGACCAGCACATCCCTTTTGAGACACCGCCTTCTGCCCACCACTCACGGACACATTTCTGCCT 
A GAAAA J A G C T T C T T AC T GCTCTT.-.C.AT GT GA.TGGCA.TATCTT.AC.ACT .AAAAG,AATATTATTGGGGGAAAAACT Q C*, a n GT 

gctgtacatatgctgagaaactgcagagcataatagctgccacccaaaaatctttttgaaaatcatttccagacaacctc 

TTACTTTCTGTGTAGTTTTTAATTGTTAAAWAAAAAGTTTTAAACAGAAGCACATGACATAT5AAA5CCTGCAGGACT 
10 • G3TCGTTTTTTTGGCAATTCTTCCACGTGGGACTTGTCCACA.AGAATGAAAGTAGTGGTTTTTAAAGAGTTAAGTTACAT 
ATTTATTTTCTCACTTAAGTTATT7ATGCAAAAGTTTTTCTTGTAGAGAATGACAATGTTA.ATATTGCTTTATGAATTAA 

cagtctgttcttccagagtccagagacattgttaataaagacaatgaatcatgaccgaaag 

15 Sequence ID No. 8: Human Beer protein variant (P38R) 

MQLF LALC LVCLLVHT AFR WFGQGWQA FKN DAT Z I IRE LGEY PE FFF E LENNKTMN RAENGG R F FHH F FETK DYSEY3C 
RELH FTRYVTDGFCRSAKFVTELVC3GQCGF ARLLFM.AIGRGKWWR FSG FDFRCI F DRYRAQRVQI.I.C FGG1CAF R.ARKVR 
L>V AS CKCKRL7RFHN QSELK D FGT E AAR PQKGRKr R PR AR S AXANQAE LE HAY 

20 

Sequence ID No . 9 : Verve t BEER cDNA (complete coding region) 

25 atgcagctcccactggccctgtgtcttgtctgcctgctggtacacgcagccttccgtgtagtggagggccaggggtggca 

G GCCTT CAAG.AATGA? GCCAC G 3AAAT CAT C CCC GAGCT CG C AG AGT AC CC CG AGCCTCC ACCG G AGCT GGAG AAC AAC A 
AGACCAT G.AACCGGGCGGAGAATGGAGGGCGGCCT CCCCACCACCCCTT TGA.G ACCA.AAGA.CGTGTCCGAG T ACAGCTGC 
CGAGAGCTGC.ACTTCACCCGCTACGTGACCGA.tGGGCCGTGCCCCAGCGCCAAGCCAGTCACCGAGTTGGTGTGCTCCGG 
CCAGTGCGGCCCGGCACGCCTGCTGCCCAACGCCATCGGCCGCGGCAAGTGGTGGCGCCCGAGTGGGCCCGACTTCCGCT 
30 GCATCCCCGA.CCGCT.ACCGCGCGC.AGCGTGTGCAGCTGCTGTGTCCCGGTGGTGCCGCGCCGCGCGCGCGCAAGGTGCGC 
CTGGTG GCCT CGTGC AAGT GC AAGCGCCT CACCCGCTTCC.ACAACC.AGTCGGAGCTCAAGG ACTTCGGTCC CGAGGCCGC 
TCGGCCGCAGAAGGGCCGGAAGCCGCGGCCCCGCGCCCGGGGGGCCAAAGCCAATCAGGCCGAGCTGGAGAACGCCTACT 

AG 

35 

Sequence ID No. 10: Vervet BRER protein (complete sequence) 
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MQL t LA LCL V C L L V H AA FK V V EGQG WQA FKN DAT El I EELGEYEEF P? E 1ENNKTMNR.AE- JGGR F PHI! F F ETKDVSEYSC 
RELI ! FTRYVT DGt*CRSAKPVTELVGS5QCGf'ARLLFN.AIGRGKWWRrS3 FDFRCI PDRYRAQRVQLLC PGGAA FRAKKVR 
LVASCKCKR LTRFHNQSF. f .KDFGPEAAR FQKGRK PR PRAR GAKAN QAE LENAY 

5 

Sequence ID No. 11: Mouse BEER cDNA (coding region only) 

10 AT GC AG CCC T C ACT AGCCC C G T GCC T CAT CT GCC T ACTT G YG CAC G CT G CCTT C? G TGC7 GTGG AGGG CC AGG C CTG GCA 

AGCCTTCAGGAATGATGCCACAGACCTCATCCCAGGGCTTGCvAGAGTACCCCC-AGCCTCCTCCTGAGAACAACCAGACCA 

tgaaccgggcggag.aatggaggcagacctccccaccatccctAtgacgccaaaggtgtgtccgagtacagctgccgcgag 

CTGCACTAC AC CCG CT TCCTG ACAGACGGCCCAT GC CGCAGCGCCA.AGCCGGTCACCG AGT TC-GT G T G C T C CG GCC AG T G 
CGGCCCCGCG CGGCT G CT GCCCAACGCCATCGGGCGCGT GAAGTGG7GG CGCCCG AACGG ACCGGATTT CCGCTGCATCC 

1 5 CGGATCGCT.ACC3CGCGCAGCGGGTGCAGCTGCTGTGCCCCGGGGCCGCGGCGCCGCGCTCGCGCAAGGTGCGTCTGGTG 

GCCTCGTGC.AAGTGCAAGCGCCTCACCCGCTTCCACPACCAGTCGGAGCTCAAGC^ACITCGGGCCGGAGACCGCGCGGCC 
GCA.GAAGGGTCGCAA.GCCGCGGCCCGGCGCCCGGGGP.GCCAAAGCCAACC.AGGCGGA.GCTGGAGA-ACGCCTACT.AGAG 

20 Sequence ID No, 12: Mouse BEER protein (complete sequence) 

MQPS LAFCLI CLLVHAAFC.AVEGQGWQAFRND.ATEVI FGL.GE Y FE Pr r ENNQTMNRAENGGR r EH H F Y D.AKDVS EYSCRE 
LHYTRFLTDGPCRSAK?\n'EI.YC:SGQCGFARLLFNALGRVK\VWRPNGFDFRCIEDRYRAQRVQLLCFGGA.AFRSRKVRLV 
ASCKCKR LTR FHMQSELKDFG PETAR FQKGRK PR E GARGAKAN QA E LENAY 

25 

Sequence ID No . 13 : Rat BEER cDNA (complete coding region plus 5 ’ UTR) 

30 GAGGACCGAGTGCCCTTCCTCCTTCTGGCACCATGCAGCTCTCACTAGCCCCTTGCCTTGCCTGCCTGCTTGTACATGCA 

G C CTT CG T TG CTGTGG AGAGCC AGG GG TGGCAAG C CT T C.AAG AAT G ATG C C ACAGAAAT CATCCCGGG ACT C AGAG AG T A 
CCCAGAGCCTCCTCAGGAACTAGAGAAC.AACCAGACCATGAACCGGGCCGAGAACGGAGGCAGACCCCCCCACCATCCTT 

atgacaccaaagacgtgtccgagtacagctgccgcgagctgcactacacccgcttcgtgaccgacggcccgtgccgcagt 

gccaagccggtcaccgagttggtgtgctccggccagtgcggccccgcgcggctgctgcccaacgccatcgggcgcgtgaa 

35 GTGGTGGCGCCCGAACGGACCCGAC^TCCGCTGCATCCCGGATCGCTACCGC'GCGCAGCGGGTGCAGCTGCTGTGCCCCG 
GCGGCGCGGCGCCGCGCTCGCGCAAGGTGCGTCTGGTGGCCTCGTGCAAGTGCAAGC3CCTCACCCGCTTCCACAACCAG 
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TCGGAGCTC^GGAC7TCC-GACCTGACACCGCGCGGCCGCAG.^GGTCacPAC^ r JCoCGGCCCCGC‘oCCCG3Gv.-..”^*:.-.A 

AGCCAACCAGGCGGAGCTGGAGAACGCCTACTAG 

5 Sequence ID No. 14: Rat BEER protein (complete sequence) 

1 ’ Q LS L A PC L>. C L LVK .AA F " \J A V Z SQG W QA F KM DAT Eli PGLREYPE F PQE LEMNQ7MHRAENGGR F F HH F Y DTKDYGEYG'J 
RE LH YT R FVT DG FCRSAKFVT E L7CSGQCG F ARL L F M AT GRVKWWR PNG F DFRC I F DRYRAQR VQ L LCFGGAAF A SR KVA 
LVASCKCKRLT R FHNQSELKD FGFETAR FQKGRK F R F RARGAKAM QAE LEN AY 

10 



Sequence ID No. 15: Bovine BEER cDNA (partial coding sequence) 

15 agaatgatgccacAgaaaitjatccccgagctgggcgagtaccccga.gcctctgccacagctgaacaacaagaccatgaac 

CGGGCGGAGAACGCAGGGAGACCTCCCCACCACCCCTTTGAGACCAAAGACGCCTCCGAGTACAGCTGCCGGGAGCTGCA 
C7TCACCCGCTACGTGACCGATGGGCCGTGCCGCAGCGCCAAGCCGGTCACCGAGCTGGTGTGCTCGGGCCAGTGCGGCC 
CGGCGCGCCTGCTGCCC.AACGCCATCGGCCGCGGCAAGTGGTGGCGCCCAAGCGGGCCCGACTTCCGCTGCATGCCCGAC 
CGCTACCGCGCGCAGCGGGTGCAGCTGTTGTGTGCTGGCGGCGCGGCGCCGCGCGCGCGCAAGGTGCGCCTGCTGGCCTC 
20 G7GCAAG7GCAAGCGCCTCACTCGCTTCCACAACCAGTCCGAGCTCAAGGACTTCGGGCCCGAGGCCGCGCGGCOG7AAF. 
CGGGCCGGAAGCTGCGGCCCCGCGCCCGGGGCACCAAAGCCAGCCGGGCCGA 

Sequence ID No. 16: Bovine BEER protein (partial sequence -- missing 

25 signal sequence and last 6 residues) 

N DAT E 1 1 p E LGE Y r E E L F E LNNKTMNRAENGG R P PH H P FET K DAS F.Y SCRE 1H FTRYVT DG PCRSAK PVTE L VCSGQC5? 
AkLLFNAIGRGKWWRPSG PDFRCI PDRYRAQRVQLLCPGGAAFRARKVRLVASCKCKRLTRFHNQSE LKDFGPE.AARF QT 
GRK LRPRARGTKAS AA 

30 



Sequence ID No. 17: Mlul - AvlII restriction fragment used to make mouse Beer 

transgene 

35 

CGCGTTTTGGTGAGCAGCAATATTGCGCTTCGATGAGCCTTGGCCTTGAGATTGATACCTCTGC7GCACAAAAGGCAATC 
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GACCX^C^r«GACrAGCGCATTCGTG^ACCGTCTCCTTC5AACTTATTC5aaATGGWGTGTCJft.TTCATCOAGGACNGCC 
TGATCGCAAATGGTGCTATCCA»CGCAGCGGCAATCGAAAACCCTCAGCCGGTGAC:CAATATCTACAACATCAGCCTTGGT 
AT CCT nOGTGATGAGt 'CAGCGCAGAACAAGGTAACCGTC.AGTGCCG.ATAAGTTCAAAGTT.AAACCTGGTGTTGAiTACCAA 
C.ATTGAAACCTTG.AT CGA.AAACGCGCT GAAAAACGCT GCT G AAT G T GCGG CGCTGG ATGTC ACAAAGC AAATGGC AGCAG 
5 ACAAGAAAGCGATC^ATGAACTGGCTTCCTATGTCCGCACGGCCATC.ATG.ATGGAATGTTTCCCCGGTGGTGTTATCTGG 

CAGCAGTGCCGTC3AT ACT ATG CAATTGAT AATTATT ATCAT TTG CGGGTCCTTTCCGGCGATCCGCCT TGTT ACGGGGC 
GGCG ACCT CGCGG3T TT TCGCT ATTT AT5AAAATTT7 CCGGTTT AAGGCG ?T TCCGTTC77CTTCGTCAT AACTTAATGT 

TTTTATTTAAAATACCCTCTG.AAAA.GAAAGGA.AACGACAGGTGCTGAAAGCGAGCTTTTTGGCCTCl'G'rCGTTTCCTTTC 
TCTGTTTTTGTCCGTGGAATG.ALCAATGG.AAGTCAACAAA-AGCAGAGCTTATCGATGATAAGCGGTCAAACATGAGAAT 
10 TC^C^oCCGCATAATACGACTCACTATAGGGATCGACGCCTACTCCCCGCGCATGAAGCGGAGGAGCTGGACTCCGCATG 
CCCAGAGACGCCCCCCAACCCCCAAAGTGCCTGACCTCAGCCTCTACCAGCTCTGGCTTGCGCTTGG3CGGGGTCAAGGC 
TACCACGTTCTCTTAACAGGTGGCTCGGCTGTCTCTTGSCCGCGCG7CATG7GACAGCTGCCTAGTTCTGCAGTGA.GGTC 
A.CCGTvjGAATGTCTGCCTTCGTTGCCATGGCAACGGGATGACGTTA.CAATCTGGGTGTGGAGCTTTTCCTGTCCGTGTCA 

GGAAATCCAAAT ACCCT AAAATACCCT AG AAGAGGAAGTAGC TGAG CCAAGGC7T TCCTGGCTTCT CCAGAT AAAGTTTG 

15 AC7TAGA7GGAAAAAAACAAAATGATAAAGACCCGAGCCATC:TGAAA.ATTCCTGCTA.ATTGCACC.ACTAGGAAATGTGTA 

T ATT ATT GAGCTCGTATGTGTTCTTAT7T TAAAAAG.AAAACT TTAGTCATG7T AT TAATAAGAAT T TCTC AGCAGTGGGA 
GAGAACCAATAT 7 AACACCAAGAT AAAAG T 7GG CAT GAT CCACATT GCAGGAAGATCCACGT TGGGTTTT CAT GAAT GT G 
AAG ACCCCATTT AT T AAAGT CCT AAGCT CTGT T T TTGCACAC T AGG AAGCG AT GG CCG 3G AT GGCTG A.GGGGCTGT AAGG 
AT CTTTC AATGT CT T A.CAT GT GTGTTT CCTGT CCTGCACCT A.GGACCTGCT GCCT AGC CT GC AGC AGAGCC AGAGGGGTT 
20 TCACATuATTAGTCTCAGACACTTGGGGGCAGGTTGCATGTACTGCATCGCTTATTTCCATACGGAGCACCTACTATGTG 
TCAAACACCA/TATCGTGTTCACTCTTCAGAACGGTGGTGGTCA/rCATGGTGCATTTGCTGACGGTTGGATTGGTGGTAGA 
GAGCTGAGATATATGGACGCAC.TCTTCAGCATTCTGTCAACGTGGCTGTGCATTCTTGCTCCTGAGCAAGTGGCTAAACA 

GA.CT CAC AG GGT CAGCCT C CAGC T C AGT CG CT GCATAGT CT TAGGGAAC C7 CT CCCAGTCCT CCC T ACCT CAACT AT CCA 

AGAAGCCAGGGGGCTTGGCGGTCTCAGGAGCCTGCTTGCTGGGGGACAGGTTGTTG.AGTTTT.ATCTGC.AGTAGGTTGCCT 
25 AGGCATAGTGTCAGGACTGATGGCTGCCTTGG.AGAACACATCCTTTGCCCTC7ATGCAAATCTGACCTTGACATGGGG3C 
GCTGCTCAGCTGGGAGGATCAAC7GCATACCTAAASCCAAGCCTAAAGCTTCTTCGTCC.ACC7GAAACTCCTGGACC,AAG 
GGGCTTCCGGCACATCCTCTCAGGCCAGTGAGGGAGTC7GTGTGAGCTGCACTTTCCAATCTCAGGGCGTGAGAGGCAGA 
GGGAGGTGGGGGCAGAGCCTTGCAGCTCTTl’CCTCCCATCTGGACAGCGCTCTGGCTCAGCAGCCCATATGAGCACAGGC 
ACATCCCCACCCCACCCCCACCTTTCCTGTCCTGCAGAA.TTTAGGCTCTGTTCACGGGGGGGGGGGGGGGGGGGCAGTCC 

30 T AT C CT CTCT T AGGT AC-AC AGG ACT CT GCAGoAG ACACT G CT T TGTAAGAT ACT GC AGT TT AAATT T GG ATG T T GT GAG G 

GGAAAGCGAAGGG CCT CTT T G ACC ATT C AG T CAAGGT ACCTT CTAACT C C C ATCGT ATT GGGGGGCT AC T CT AG TG CT AG 
ACATT GCAG AG AG CCTCAG AACT GT AGTT ACCAGT GT GGT AGG ATT GAT CCT TC AGGG AGCCT G AC ATG TGAC AGT TC CA 
TT CT T CAC C C AGT CACCG AACATT T AT T C AGT AC CT ACCCCGT AAC AGGCACCGT AG C AGGT ACTG AGGG AC G G ACCACT 
C AAA.G AACT G ACAGACCG AA.G CCT T GG AAT A! AAAC ACC AAAG C ATCAGGCT CTGCCAACA.G AACACT CTT T AACA.CT CA 
3 5 GGCCCTTT AA CACTC AGG A C CCCC ACCCC CACC CC AAG CAGTTGGC ACTGCT ATCC ACATTTT ACAG AG AG G A AAAACTA. 

GGCA^CAGGACG.AT.AT AAGTGGCTTGCTT AAGCTTGTCTGCATGGT AAATGGCAGGGCTGGATTGAGACCCAGACATTCCA 




WO 00/32773 



PCT/US99/27990 



76 



ACTCT AGGGT C7ATTTT7 CTTTTT TCTCGTT3T TCG AAT CTGGG TC7T ACTG GGTAAACT CAGG CTAGCCTCAC ACTCA7 
ATCCTTCTCCCATGGCTTACGAGTGCTAGCATTCCAGGTGTGTGCTACCATGTCT3ACTCCCTGTAGCTTGTCT.ATACCA 
TCCTCACAACATAGGAATTGTGATAGCAGCAGACACACCGGAAGGAGCTGGGGAAATCCCACAGAGGGCTCCGCAGGATG 
A/GAGGCGAAT GCCTACAC AG AAGGTGGGGAA3GG AAGC AG AGGGAAC AGO ATGGGCG TGGG ACCACAAGT CT AT TTGGGG 
5 A.AGCT5CCGGTAACCGTATATGGCTGGGGTGAGGGGAGAGGTCATGAGATGAGGCAGG.AAGAGCCACAGCAGGCAGCGGG 
TACGGGCTCCTTAT7GCCAAG.AGGCTCGGATGTTCCTCCTCTTCCTCCTTCCGGGGCTGCCTG7TCATTTTCCACCACTG 
CCTCCCATCCAGGTCTGT3GCTCAGGACATCACCCAGCTGCAGAAA;::T-:‘4GGCATCACCCACGTCCTGAATGCTGCf;GAGG 
GCAGG TCCTT CAT GC ACGT C AAC ACC AG T GCTAG CT TCT AC GAG CAT TCT GG CAT CAC CT ACT T GGC-CAT CAAGG CC AA T 

gatacgcaggagttcaacctcagtgcttactttgaaagggccacagatttcattgaccaggcgctggcccataaaaatgg 
10 taaggaacgtacattccggcacccatggagcgtaagccctctc-ggagctgcttcctccaaagagggccccacttgaaaaa 

GGT7CCAGAAAGATCCCA-AAATATGCCACCAACTAGGGATTAAG7GTCCTACATGTGAGCCGATGGGGGCCAC7GCATAT 
AGTCTGTGCCATAGACATGACAATGGATAATAATATTTCAGACAGAGAGCAGGAGTTAGGTAGCTGTGCTCCT7TCCCTT 
TAATTGAGTGTGCCCATTTTTTTATTCATGTATGTGTATACATGTGTGTGCACACATGCCATAGGTTGATACTGAACACC 
GTCTTCAATCGTTCCCCACCCCACCTTAT7TTTTGAGGCAGGGTCTCTTCCCTGATCCTGGGCCTCATTGGTTTATCTAG 
15 GCTGCTGGCCAGTGAGCTC7GGAG7TCTGCTTTTCTCTACCTCCCTAGCCCTGGGACTGCAGGGGCATGTGc:TGGGCCAG 
GCTTTTAT GT CGCGTTGGGG ATCT G AACT7 AGGTCCCT AGGCCTG AGCACCGT AAAG ACTCT GCCACATCCCCAGCCT GT 
TTGAGCAAGTCAACCA7TCCCCAGAATTCCCCCAGTGGGGCTTTCCTACCCTTTTATTGGCTAGGCATTCATGAGTC-G7C 
ACCTCGCCAGAGC-AAT5.AGTGGCCACGAC7GGCTCAGGGTCAGCAGCCTAGAGATACTGGGTTAAGTCTTCCTGCCGCTC 
GCTCCCTGC\AGCCGCAGACAGAAAGTACCACTGAATGAGAGCTGGCTAG?GGTCAGACAGGACAGAAGGCTGAGAGGGTC 
20 ACAGGGCAGATGTCAGGA3AGCAGACAGG7TC7CCCTCTGTGGGGGAGGGGTGGCCCACTGCAGGTGTAATTGGCCTTCT 
TTGTGCTCCATAGAGGCTTCCTGGGTACACAGCAGCTTCCCTGTCCTGG7GATTCCCAAAGAGAACTCCCTACCACTGGA 
CTT ACAGAAGT TCT ATTGAC TGGTGT AACGGTT CAACAGCTTTGGCTCT TGGTGGACGGTGCAT ACTGCT GT.ATCAGCT C 

AAGAGCTCATTCACGAATGAACACACACACACACACACACACACACACACACACAAGCTAATTTTGATATGCCTTAACTA 
GCTCAGTGACTGGGCA7TTCTGAACATCCCTGAAGTTAGCACACATT7CCCTCTGGTGTTCCTGGCTTAACACCTTCTAA 
25 ATC7ATATTTTATC7TTGC7GCCCTGTTACCTTC7GAGAAGCCCCTAGGGCCAC7TCCCTTCGCACCTACATTGCTGGAT 
GGTTTCTCTCCTGCAGCTCTTAAATCTGATCCCTCTGCCTCTGAGCCATGGGAACAGCCCAATAACTGAGTTAGAaATAA 
AAACGT CrCT AGCC AAAACT T CAGCT AAATT T AG AC AAT AAATCT TAC TGG TT GTGGAAT CCTT AAGAT T CTT CATGACC 

tccttcacatggcacgagtatgaagctttattacaattgtttattgatcaaactaagtcataaaaagccagttgtctttc 

A.CCT GCTCAAGGAAG GAAC AAAATT CAT CCT T AACT GAT C TGTGC ACC T TG CACA.AT CC AT ACG AATAT C TT AAG AGT AC 
30 T AAGATTTT GGTTGTGAG AGT CACAT GTTACAGAAT GTACAGCTTTGACAAf4GTGCAT CCTTGGGATGCCG.AAGTG.ACCT 

GCTGTTCCAGCCCCCTACCTTCTGAGGCTGTTTTGGAAGCAATGCTCTGGAAGC.AACTTTAGGAGGTAGGATGCTGGAAC 

AGC 3G GT CACT T C AGCATCC CGATGACG AAT CC CGT CAAAGCTGT ACAT 7 C TGT AACAGACrGGGAAAG CT GC AG AC T T T 

AAGGCCAGGGCCCTATGGTCCCTCTTAATCCCTGTCACACCCAACCCGAGCCCTTCTCCTCCAGCCGTTCTGTGCTTCTC 
ACTCTGG.AT A.GAT GGAGAA.CACGGCCTT GCTAGTT AAAGGAGTGAGGCTTCACCC’l’TCTCACATGGCAGTGGTTGGTCAT 
35 CCTCATTCAGGGAACTCTGGGGCATTCTGCCTTTACTTCCTCTTTTTGGACTACAGGGAATATATGCTGACTTGTTTTGA 
CCTTGTGTATGGGGAGACTGGATCT7TGGTCTGGAATGTTTCCTGCTAG7TTTTCCCCATCCTTTGGCAAACCCTATCTA 
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TATCTTACCAC?AGGCATAGTGGCCCTC3TTCTwGAGCCTGCCTTCAGGCTG3TTCTCGGCiC3K:CATGTCCCTGGTTTCT 
C C CC AG C A 7 AT G G TOT 7 C AC A GT G T T C ACT GCGGGTGGTTGCTGAAC.AAAGCGGGGA.TTGCA.TCCCAGAGCTCCGGTGCC 
TTGTGGGTACACT-3CTAA.5ATAAAATGGATACTGGCCTCTCTCTGACCAC7TGCAG.AGCTCTGGTGCCTTGTGGGTACAC 
TG C7 AAGA.T AAAATGGA.TACTGGCCT CTCT C7A.TC C.AC7T GCAGG ACT CTA.G GGAAC.AGGAATCC ATTACT G AGAAAACC 
5 AGGGGCTAGGAGCAGGG.AGGTAGCTGGGCAGCTG.AAG7GC7TGGCGACTAACC.AATGAATACCAGAGTTTGGATCTC7AG 
AATACTCT TAAAATCTGGGT GGGCAGAG TGGCCTG CCT 37 AATCCCAGAACTCGGGAGGCGGAGACAGGG AATCAT CAGA 
GCAAACTGGC'rAACCAGAATAGCAAAACACTGAGCTCTl-GGCTCTGTGAGAGATCCTGCCTT.AACATAT.AAGAGAGAGAA 
TAAAACA77GAA‘3AAGACAG7AGATGCCAAT?TTA_AGC7CCCACATGCACATC^ACAAGTGTGCG , I , T'1 , GAACACACA.TAT 
GCACTCA7GTG.AACC.AGGCATGCACACTCGGGCTTA7CACACACATAATTTGA.AA0AGAGAGTGAGAGAGGAGAGTGCAC 
10 AT T AGAG 7T CAC.AGG.AAAGT G7GAGT GAGC AC ACCC AT GC AC.ACAG AC AT G T GT G CC AGGG AG TAG GAAAGGAGCCTGGG 

TTTGTGTATAAGAGGGAGCCATCATGTGTTTCTAAGG.-.GGGCGTGTGAAGGAGGCGTTGTGTGGGCTGGGACTGGAGCAT 

ggttgtaactgagcatgctccctgtgggaaacagg.agggtggccaccctgcagagggtcccactgtc:-::agcgggatcagt 

AAAAGCCCCTGCTGAGAA.CTTTAGGTAATAGCCAGAGAGAG.AA.AGGTAGGAAAGTGGCUSGGACTCCCA.TCTCTGATGTAG 

GAGGAT CT GGGCA.AGT AGAGGT G CGT ? T GAG G T AG AAAG AGGGGT GCA.GAGGAGATGCTCT TAATT CTGGGT CAG CAGTT 
15 T C TT TCC AAAT.AAT GCCT GTGAGG.AGGT G T A GGT ( 4GTGGC CA.T TC ACT C.ACT C AGCA.GAGGGATG ATGATGCCCGGTGG.A 

TGCTGGAAATGGCCGAGCATCAACCCTGGCTCTCG.AAGAACTCCATCTTTCAGAAGGAGAGTGGATCTGTGTATGGCCAG 
CGGGC4TCACAGGTGCTTGGGGCCCCTGGGGGACTCCTAGCACTGGGTGATGTTTATCGAGTGCTCTTGTGTGCCAGGCAC 
TGGCCTGGGGCTTTGTTTCTGTCTCTGTTTTGTTTCGTTTTTTGAGA.CAGACTCTTGCTATGTATCCGTGTCAATCTTGG 
AA.TCTCA.CTGCA.TAGCCCAiGGCTGCGGA.GA.GA.GGGG.AGGGC.AATAGGCCTTG'l’AAGCAAGCC.AC.ACTTC.AGAGACTA.GAC 
20 TCCACCCTGCGAATGATGACA3GTCAGAGCTGAGTTCCGG.AAGATTTTTTTTCCAGCTGCCAGGTGGAGTGTGGAGTGGC 
AG C TAG C G GCAAGG 3T.AG AGGGC G AGCT C CCT G TG C A.GGAG.AAATGC AAGC.AAGAG AT GGC :AAGCCA.GTG.AG T T.AAG CAT 

TCTGTGTGGGGA.GCAGGTGGATGAA.G.AG.AGA.GGCTGGGCTTTCGCCTCTGGGG3GGGGGTGAG3GGTGGGGAT3AGGTGA 

GAGGA.GGGCA.GC7C CCTGCAGT GTG.AT GA.5ATT TTTCCTG.ACA.GTGACCT T TG 3CCTCTCCXT CCCCCACT T C CCT? CTT 
TCCTTTCTTCCCACCATTGCTTTCCTTGTCCTTGAGAAATTCTGAGTTTCCACTTCACTGGTGATGCAGACGGAAACAGA 
25 AGCCGTGTGTGTGT3TGTGTGTGTGTGTGTGTGTGTGTGTGTGTGTGTGTTTGTGTGTATGTGTGTGTGTGTGTTTGTGT 

gtatgtgtgtcagtgggaatggctcatagtctgcaggaaggtgggcaggaaggaataagctgtaggctgaggcagtgtgg 

GA.TGC.AGGG AGA.G A.3GA.G AGGAGGGA.T ACCA.GAGAA.GGAAAT T AAGGG AGC T ACAAG AGG GCATTGTTGGG GTGTGTGTG 
TGTGTGTGT TGTTT.ATAT TTGTA.T T G GAAAT.ACATT CT T TT.AAAAAAT ACT T A TCC AT TT AT TT AT T T TT ATG TGCA.C3T 

GTGTGTGCCTGCATGAGTTCATGTGTGCCACGTGTGTGCGGGAACCCTTGGAGGCCACAAGGGGGCATCrTGATCCCCTGG 
30 AACTGGAGTTGGAGGAGGTTGTGAGTCCCCTGACATGTTTGCTGGGAACTGAACCCCGGTCCTATGCAAGAGCAC-GAAGT 
GCAGTTATCTGCTGAGCCATCTCTCCAGTCCTGAAATCCATTCTCTTAAAATACACGTGGCAGAGACATGATGGGATTTA 
CGT ATGG ATTT AAT 3T GGCGGTCATT AAGTT CCGGCACA.GGCAAGCACCTGT AAAGCCATCACCACAACCGCAA.CAGTGA 
ATGTGACCATCACCCCCATGTTCTTCATGTCCCCTGTCCCCTCCATCCTCCATTCTCAAGCACCTCTTGCTCTGCCTCTG 
TCGCTGGAGAACAGTGTGCATCTGC.ACACTCTTA.TGTCA.GTGAAGTCACACAGCCTGCACCCCTTCCTGGTCTGAGTA.TT 

35 tgggttctga.ctctgctatcacacactactgTactgcattctctcgctctctttttttaaacatatttttatttgtttgt 

GTGTATGCACAT GT GCCACATGTGT.ACAGAT ACT AT GGA3GCCAGAAGAGGCCATGGCCGTCCCTGGAGCTGGAGT T ACA 
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CGCAGCC5TGTGAGCTGCCTCGTa7CGGTGCTGGG“CaW.CTTG.^TCT>AaGC£ASCACTTTTfACTGCTGACK3C.-.C5-:: 

T CT CAGTA.CCCTT CTTC ATT T CT CCGCCT GGG TTCX :ATTG? ATGGACACAT GTAGCTA.GAA.TATCTTGCTT ATCT.AAT ?A 
TG7 ACATT GTT TTGT GCT.AAGAGAGAGT A.A7GCTCTA7A3CCTGAGC7 GGCCTC.AA CCTT GCCATCCTCCTGCCT CAGCC 
T CCTCCTCCT GAGTGCT AGG AT GAC.AGGCG AGTGGTAACTT ACATGG77 T C ATGT T TTGTTCAAG.ACT GAA3GA.T AAC.AT 
5 TCATACAGAG.AAGGTCTGG3TCAC.AAAGTGTGCAGTTCACTG.AATGGCACAACCCGTGATCA-AGAAACAAAACTCAGGG.''. 
CTGGAGAGATGGCACTGACTGCTCTTCCAGAG6TCCGGAGT70AATTCCCAGC.AACCACATGGTGGCTCACAGCCATCTA 
T.AACGAGATCTGACGCCCT CT 7 CTGGT GTGTCTG.AAGACAGCTACAGTGT ACTCAC AT AAAAT AAAT.AAATCT T TAAAAC 
ACACACACACACACAATTACCACCCCAGAAAGCCCAC?CCATGTTCCC7CCCACG7CTCTGCCTACAGTACTCCCAGG77 
ACCACTGTTC.AGGCTTCTAACAAGCTGGTT7ACTTGGGCCTCTTTTGTGC7CTGTGGAGCCACACATTTGTGTGCCTCA7 
10 ACACGTTCT7TCT AGT AAGTT GCAT ATT ACT C7GCGT IT? T ACATGT ATT 7.ATTT ATTG TA.GTTGTGT G7GCGTGT3GG C 

CCATGCATGGCACAGTGTGTGGGGATGTCAGAGTAT7GTG.AACAGGGGACAGTTC77TTCTTCAATCATGTGGGTTCCAG 
AGGTTGA-.CTCA.GGT CA.TC.AT GTGT GGCAGCAAATGC.C77T ACCCACTGAGACAT ' ITCC AT ATTCTTTTTTTTTCCCCTG 
AGGTGGGGGCTTGTTCCATAGCCCAAACTGGCTTTGCAC7TGCAG77CAAAGTG.AC7CCCTGTC7CCACCTCTTAGAGTA 
TTGGAATTACGATGTGTACTACCACACCTGACTGGATCATTAATTCTTTGATGGGGGCGGGGAAGCGCACATGCTGC.AGG 
1 5 TGAA.GGGATGACTGGACTGGACA.TGAGCGTGGAAGCCAGA.GAA.CAGCTTCA.GTCT.AATGCTCTCCCAAC7GAGCTATTTC 

GGTTTGCCAGAGAACAACTTACAGAAAGTTCTCAGTGCCATGTGGATTCGGGGTTGGAGTTCAACTCATCAGCTTGACAT 

tggctcctctacccactgagccttctcactactctctacctagatcattaattcttttttaaaaagacttattagggggc 

TGGA.GA.GATGGCTCAGCCGTTAAG.AGCACCG.AATGCCCTTCCAG.A3GTCCTC-AGT7CAATTCCCAGCAT3CCAT7GCTGG 
GCA.GTAGGGGGCGCAGGTGTTC.AACGTGAGTAGCTGTTGCCAGTTTTCCGCGGTGGA.GAACCTCTTGACACCCTGCTGTC 
20 CCTGGTC ATTCT GGGT GGGTGCATGGT GAT ATCCTTG TTGT ATGGAAGACTT TGAC7 3T TACAGTGAAGTT GGGC T TCCA 

CAGTTACCACGTCTCCCCTGTTTCTTGCAGGCCGGGTGCTTGTCCATTGCCGCGAGGGCTACAGCCGCTCCCCAACGCTA 
GT TAT CGCCTACCTCAT GAT G CGGCAGAAG A? GGACGT CAAGTCTG C T CT G AGT AC T STG AGG CAG AAT CGTGAGATCGG ' 
CCCCAACGATGGCTTCCTGGCCCAACTCTGCCAGCTCAATGACAGACTAGCCAAGGAGGGCAAGGTGAAACTCTAGGGTG 
CCCACAGCCTCTTTTGCAGAGGTCTGACTGGGAGGGCCCTGGCAGCCATGTTTAG3AAACACAGTATACCCACTCC-CTGC 
25 ACCACCAGACACGTGCCCACA7CTGTCCCAC7CTGGTCCTCGGGGGCCACTCCACCCTTAGGGAGCACATGAAGAAGCTC 
CCTAAGAAGTTCTGCTCCTTAGCCATCCTTTCCTGTAATTTATGTCTCTCCCTGAGGTGAGGTTCAGGTTTATGTCCCTG 
TCTGTGGCATAGATACATCTCAGTGACCCAGGGTGGGAGGGCTATCAGGGTGCATGGCCCGGGACACGGGCACTCTTCAT 
GACCCCTCCCCCACCTGGGTTCTTCCTGTGTGGTCCAG.AACCACGAGCCTGGTAAAG3AACTATGCAAACACAGGCCCTG 
ACCTCCCCATGTCTGTTCCTGGTCCTCACAGCCCGACACGCCCTGCTGAGGCAGACG.AATGACATTAA.GTTCTGAAGCAG 
30 AGT GG AG AT .AG .AT TAG? GACT AGATTT CCAAAAAGAAGGAAAAAAAAG GC TG C AT TT TAAAATT ATTTCCT T AG AATT.AA 

AG.ATACTACATAG3GGCCCTTGGGT.AAGCAAATCCATTTTTCCCAGAGGCTATCTTGATTCTTTGGAATGTTTAAAGTGT 
GCCTTGCCAGAGAGCTTACGATCTATATCTGCTGCTTCAGAGCCTTCCCTGAGGATGGCTCTGTTCCTTTGCTTGTTAGA 
AG AGCGAT GCCTTGGGCA5GG7TTCCCC CTT 7 TCAG AAT AC AGGG T GT AAAGTCC AG CC'i' ATTA.C AAAC AAAC.AAACA.AA 
CAAACAA-ACAAAGG.ACCTCCA.7TTGGAG.AATTGCAAGGATTTTATCCTG.AATTATAGTGTTGGTGAGTTCA.AGTCATC.AC 
35 G CC.AAGTGCTTGCdAT CCTGG7TGCT ATTCTAAGA.AT.AATTAGGAGGAGGAACCT AGCCAATTGCAGCTCATGTCCGTGG 

GT GTG T GC AC GGGTGC A.T.AT G7T GG AAG 3GGTGCCTG T CCC CT TGC CG ACAG.AAGG AAAAT G AAAGGCCCC T CT G CTC AC 
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CCTGGCCA77TACGGGAGGCTCTGCTGGTTCCACGGTCTC7GTGCAGGATCCTGAAACTGACTCGCTGGACAGAAACGAG 
ACTTGGCGGCACCAJGAGAATGGAGAGAGAGAGAGCAAAGAAACiAAACA.GCCT?TAAAAGAACTTTCT.AAGGGTGGTTTT 
TGAACCT CGCT GGACCTT GTATGTGTGCACATTTGCCAGAuATT GAACAT AATCCTCT7GGG ACT TCA.CG T T CT CAT TAT 
TTGTATGTC7CCGGGGTCACGCAGAGCCGTCAGCCACCACCCCAGCACCCGGCACATAGGCGTCTCATAAAAGCCCATTT 
5 T ATGAGA.AC CAGAGCTGTTTG AGT ACCCCGTGT A? AGAG.AGAGTTGTT GT CGTGGGGCACCCGGATCCCAGC AG CCT GGT 

T GC CT GCCT GT AGG A.TGT CT T AC AGGA.GT ?T 3C AG AG.AAACCT T C C TT GG AGG GAAAG AAAT AT CAGG G ATTT T TG TT G A 
AT ATTTCAAATTC AGCTT7 AAGTG7 AAGACT CAGCAGT G77CAT GGTT AAGGTAAGG.AACATGC CTTTTCCAGAGCT G CT 
GCAAGAGGCAGGAGAAGCAGACCTGTCTTAGGATGTCACTCCCAGGGT.AAAGACCTCTGATCACAGCAGGAGCAGAGCTG 
7GCAGCCTGG ATG G7CATTGT CCCCT ATTCT GT GTGAC CACAGCAACCCTGGT CACAT AGGGCT GGTCAT CCT? TTTT TT 
i 0 TTTTTTTTTTTTTTTTTT TGGCCCAGAAT G.AAGTG ACC AT A3CC.AAGTTGTGT ACCTCAGTCTT T AGTTT CC.AAGCGGCT 

CTCTTGCTCAATACAATGTGCATTTC.AAAATAACACTGTAGAGTTGACACAACTGGTTCATGTGTTATGAGAGAGGAAAA 

G AGAGG AAAG.AAC.AAAACAAAAC AAAAC AC C ACAAA CC AAAAAC AT CT GGGCT AG CC AGGC.AT G AT TGC AAT GT C T ACAG 

GCCCAGT7CATGAGA.GGCAGA.GAC.AGGAAGACCGCCGAA-AGGTCA-AGGATAGCA7GGTC7ACGTATCGAG.ACTCCAGCCA. 
GGGCTACGGTCCCAAGATCCTA.GGTTTTGGATTTTGGGCTTTGGTTTTTGAGACAGGGTTTCTCTGTGTAGCCCTGGCTG 
15 T CCT GGAACT CGCTCTGTAG ACCAGG( :TGGCCTC AAAC T T.AG AG AT CTGCCTGACTCT GCCTTTGAGGG CTGGGACGAAT 

GCCACCACTGCCCAACTAAGATTCCATTAAAAAAAAAAAAAGTTCAAGATAATTAAGAGTTGCCAGCTCGTTAAAGC ;TAA 
GT AGAAGC AG7 CTC.AGGCCT GCTGCTTGAGGCTGTT CTT GGCT TGG ACCTG AAATCTGCCCCCAAC AGTGTCC.AAGTGCA. 
CATGACTT7GAGCCATCTCCAGAGAAGGAA.GTGAAAATTGTGGCTCCCCAGTCGATTGGGACACAGTCTCTCTTTGTCTA 
G3TAACACAT3GTGACACA.TAGCA.T7GAACTCTCCACTCTGAGGGTGGGTTTCCCTCCCCCTGCCTCTTCTGGGTTGGTC 
20 ACCCCATAGGACAC-CCACAGGACAGTCACTAGCACCTACTGGAAACCTCTTTGTGGGAACATGAAGAAAGAGCCTTTGGG 
AGATTCCTGGCTTTCCATTAGGGCTGAA n .G7ACAA.CGGTTCTTGG7TGGCTTTGCCTCG7GTTTATAAAACTAGCTACTA 
TTCI'TCAGGTAAAAT.ACCGATGTTGTGGAAAAGCCAACCCCGTGGCTGCCCGTGAGTAGGGGGTGGGGTTGGGAATCCTG 

G.AT AGT GT TCT ATCCA.TGGAAAGT GGT GGAAT AGGAATTAAG3GTG7 TCCCCCCCCCCCC AACCTCTTCCT CA.GA.CCCA.G 
CCACTT TCTATGACT T ATAAACATCCAGGT AAAAATTACAAACATAAAAATGGT TTCTCTTCTCAAT CT TCT.AAAGTCTG 
25 CCTGCCTT TTCCAGGGGTAGGTCTGT TTCTTTGCT GTTC TAT TGT CT TGAG AGC ACAGA.CT AAC ACT TACCAA ATGAGGG 

AACTCTTGoCCCA/r.ACTAAGGC7CTTCTGGGCTCCA.GCACTCTTAAGTTA.?TTTAAGAATTCTCACTTGGCCTTTA.GCAC 

A.CCCGCCACC CC CAAG T GGGT G TGG AT AAT GCC AT GG CC AGC AGG GGGC ACT GTTGAtiGCGGGTGCCTTTCCACC TT AAG 
TTGCTT AT AGT ATTT AAGATGCT AAATGTTTT AATCAAGA.GAAGCACTG.ATCTTA.T AAT ACGAGG ATAAGAGAT TT TCTC 
ACACGAAATTGT CTT TT TCAT AATT CT TTT AC AGGCT TT G TCCT G AT C GT AGCAT AG AG AG AATAGCTGG AT AT T T AACT 

30 TGTATTCCATTTTCCTCTGCCAGCGTTAGGTTAACTCCGTAAAAAGTGATTCACTGGACCGAAGAGGCTCAGAGGGCAGG 
GGAT GGTGGGGTGAGGC AGAGCACTGTCACC7GCC AGGCA/TGGGA.GGTCCTGCCATC:CGGGA.GGAAAAGGAAAGTT T AGC 
CTCT AGTCT ACCACCAGTGTT .AACGC ACTCT.AAAGTTGT, AACCAAAATAAATGTCTTACATTACAAAGACGTCTG7TTTG 
TCTTTCCTTTTGTGTGTTTGGGCTTTTTATGTGTGCTTTAT.AACTGCTGTGGTGGTGCTGTTGTTAGTTTTGAGGTAGGA 

tctcaggctggccttgaacttctgatcgcctgcccctgcccctgcccctgcccctgtccctgcctccaagtgct.aggact 

AAAAGCACAT G C CAC CACACC AGT AC A.GC AT 7 T TT CT AAC AT T TAAAAAT AAT CACCTAGGG G CTGG AG AGAGGG ? T CC A 

GCTAAGAGTGCACACTGCTCTTG3GTA.G3ACCT3AGTTTAGTTCCCAGAACCTATACTGGGTGGCTCCAGGTCCAGAGGA 
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TCCAGGA.CCTCTG6CCTCC.ATCA\A3CATCTGCTCTT.AGCAC.A7ACCCACATA.CA.uA7.ACA.C.AC.ATA.A.AAATA.AA. 






CTTTAAAAAGCTCCTAAAACCT.AGCCCTTGGAGGTA.CG.ACTCTGGAAAGCToGCATACTGTiiTAAGTCCATCTCA.TGGTG 

T i CT uGCT AACuT AA.u A.C T T AL. J . vr . a ^AC AG.AAAAG A>.e 7 C AGGGT G T GCT GGGGG T T GG G A T GG AGG AA.G A.G GGA.TG A.GT 

AGGGGGAGCACGGGG.AACTTGGGG.AGTGAAAA.TTCTTTGCAGG.ACACTAGAGGAGGA.TAAA.TACCAIiTCATTGCACCCAC 
5 TACTGGAC.AACTCCAGGG.AATTATGCTGGGTGAAAAGAGAAGGCCCCAGGTATTGGCTGCATTCGCTGCATTTGCGTAAC 

AT TTTTTT AAATTGAAAAG AAAV vGA.T CTAAA7 CAA3 2 7TAG A7GAGTGGT TGCT GTGAGCTGAGAGCTGGGGTGAGTGA 

gacatgtggacaactccatcaaaagcgacagaaagaacgggctgtggtgacagctacctctaatctccacctccgggag 

GTG AT CAAGG TT A.GCCCT CAGCT.AGCCTGTGGTGCAT <3 A3.ACCCTGT TTC.AAAAA CTTTAAT AA.AG AAATAA.TGAAAAA.A 

GACATCAGGGCAGATCCTTGGGGCCAAAGGCGGACAGGCGAGTCTCGTGGTAAGGTCGTGTAGAAGCGGATGCATGAGCA 
10 CGTGCCGCAGGCA7CATGAGAGAGCGCTAGGT.AAGTAAG3ATGGATGTGAGTGTGTCGGCGTCGGCGCACTGCACGTCGT 
uTGGTGuTGG.-.^T uGCATC • : : GGTGA3 CTGTGGA.ijGuGAAATGGGTA.GGGA.GATCATAAAA.TCCCTCCGAATT.AT 
TTCAA.GAA.CTGTCTA.TT.AC.AATTA.TCTCAAAA.TATTAAAAAAAAAGAA.GAA.TTAAAAAACAAAAAA.CCTATCCAGGTGTG 
GTGGTGTGCACCTATAGCCACGG3CACTTGGAAA.5CTGGAGCAAGAGGATGGCGAGTTTGAAGGTATCTGGGGCTGTACA 
GCAA.GACCGTCGTCCCCAAA.CCAAA.CCAAACA.GCAAACGCATTATGTCACACAAGAGTGTTTATAGTGAGCGGCCTCGCT 
15 GAGAGCATGGGGTGGGGGTGuGGGTGGGGGAC.AGAAA.TATCTAAACTGCAGTCAAT.AGGGATCCACTGAGACCCTC-GGGC 

ttgactgcagcttaaccttgggaaatgataagggttttgtgttgagtaaaagcatcgattactgactt.aacctcaaatga 

AGAAAAA.GAAAAAAA.GAAAA.CAA.CAAAA.GC'C.A-A.CCAA.GGGGCTGGTGAGATGGCTCAGTGGGTAAGAGCACCCGACTGC 
TCTTCCGAAGGTCCAGAGTTCAAATCCCAGC.AACCACATGGTGGCTCACAACCATCTGT.AACGAGAT.ATGATGCCCTCTT 
CT^GTuTGT'wT-j.-^.GA.CAGCTACAGTGTACTTACATAT.-_ATAAATAAA.TCTTAAAAAAA.AAAAA aaaaaaaaaa.gccaaa. 
20 CCGAGCAAA.CCAGGCCCCCAAACAG.AAGGCAGC-CACGACGGCAGGCACCACGAGCCATCCTGTGAAAAGGCAGGGCTACC 

CATGG GCCGAGGAGGGTC CAGAG A GAT AGGCTGGT A.AG CTCAGTTTCT CT GTATACC CTTTT TCTTGT TGACACTACTT C 
AAT T AC AG AT AAAA7 AACAAAT AAA.C AAAA.T CT AGAG CCTGG CC A.CTCT C TG C? CGC T T G AT T T T TCCT G T T ACG TCCAG 
CAGGTuuCuGAAGT oT TCvAAuG.“.C. !S .SA.T CGCAT CA.TT A-AGGTGGCCA.GC AT AA.T CTCCCATCA.GCAGGTGGTGCTGTG A 
G AACC ATTATGGTG CTCACAG.AAT CCGGGGCCC AGG AGCTGCCCTCTCCC AA.G TCTGGAG CAA.TAGGAAA.GCT TT CTGGC 

25 CCAGA.CAGGGTTAACAGTCCACA.TTCCAGAGCAGGG3.AAAAGGAGA.CTGGA.GGTC.ACA.GACAAAA.GGGCCAGCTTCTA.AC 
AACTTCACA.GCTCTGGTA.GGA.GAGATAGATCACCCCC.AAC.AATGGCCA.CAGCTGGTTTTGTCTGCCCCGAAGG.AAACTGA 



CTTAGGAAGCA.GuTATCAGAGTCCCCTTCCTGAGGGGA.CTTCTGTCTGCCTTGTAAAGCTGT CAGAGCA.GCTGCATTGA.T 



GT GT G G GTG AC A.3.AAG .ATG AAAA.G G.AGGACC CAGGC AG A T CGCC A.CAG A? GGACC GGCC ACTTA.CA.AGT CG AG GCAGGTG 

GCAGAGCCTTGCA.GAA.GCTCTGCAGGTGGACGACACTGA.TTCATTACCCAGTTA.GCATACCA.CAGCGGGCTAGGCGGACC 
30 ACAGCCTCCT7CCCA.GTCTTCCTCCAGGGCTGGGGAGTCCTCCAACCTTCTGTCTCAGTGCAGCTTCCGCCAGCCCCTCC 

TCCTTTTGC.ACCTCAGGTGTGAACCCT CCCT CCT CT CCTT CTCCCTGT GGCAT GGCCCTCCTGCT A.CTGC AGGCTGAGCA. 
T TGGAT TT CT TT uT GCT T A.G AT A.GA.CCTGAGATGGC TTT C TG AT T T A.T A.T A.TAT AT AT CC A.T CC CT TGG A.T CTTAC AT CT 
AGGA.CC CAGAGCTG TTTuT GAT ACC A.T AAG AG 3C TG GGG AGAT G A.T A.TGGT AA.GAG TGCT TGCTGT ACAAGCATG AAGAC 
ATG AGT T CGAA.TCCCC A.GC.AACCAT GT GG AAAAAT AACCT T CTAA.CCTC A.G A.G T TG.AG GGG.AAAGGCAC C T G GA.T T CT GG 

35 ’■GGGCTTACTGGC«GCTAGCCAGCC?MCCTA B ATCTCTa».GTO.GAGATCCTGTCTCAGGCy>ATA®.CTTGGGAGA.«LTGA 

CTGAGAAAGACACCT CCTC AGGTCTCCCAT GCAC CC ACAC AGACACACGGG GGG GGGG T AATG7AAT AAGCT AAGAAAT A 
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ATGAC^CAAATGATTTTTTSCTAAGAAATGAAATTCTI-TGrrGGCCGCAAGAAGCCTGSCCAC-GGAAGGAACTCXJCTTTG 
GCACACCAGCCTATAAG7CACCAT3AG7TCCC7GGC7AASAA7CACATGTAATOGAGCGCAGGTCCCTCT7GCCTGGTGG 
TTGCCTCTCXCACT3GT7TTGAAGAGA a ATTCAA3AGA3AT(7rcr71’TGGTCAGAATTGTAGGT3CTGAGCAATGTGGAGC 
TGGGGTCAATGGGATTCCTTTAAAGGCATCCTTCCCAGGGCTGGGTCATACTTC.aATAGTAGGGTGCTTGCACAGCAAGC 
5 GT GAGA.CCCT AGGT T AG.AGT CCCCAGAAT C7 GCCCCC -ACCCC CCAAAnAG GC AT CCTTCTGCCTCTGGGTGGGTGG GGG 

GAGCAAACACCTTT.AACTAAGACCATT AGC7 GGCAGG 3 GT AACAAAT GA3CTT GGCTA.GAGG.AATT TGGT CAAG CT GGAT 
TCCGCCTTCTGTAGA a GCCCCACTTGTTTCCT?TGTTAA3C7GGCCCAC:AGT‘i*TGTTTTGAGAA.TGCCTGAGGGGCCC:AG 
GGAGCCAGACAAT T AAAAGCC AAGCTC AT T 7 T GAT AT CTG/AAACCVJ AGC CT G ACT GCCC ? 3 3 CC G TGG GAG GTAOTGG 

gagagctggctgtgtccctgcctcaccoacgccccccccccc.aacagacactcctcgggtcagctgggaggtgccagcag 
10 caatttggaagtttactgagcttgagaagtcttgggagggctgacgctaagcacaccccttctccacccccccccacccc 

ACCCCC GTG AGG AGGAGGGT GAGG.AAAC AT GGGACC AG CCCT GCT CC A;GCCCG TCC 7 T AT T GG CT G GC AT GAG GC AG AGG 
GGGCTTTAAAAAGGCAACCGTATCTAGGCTGGACACTGGAGCCTGTGCTACCGAGTGCCCTCCTCCACCTGGCAGCATGC 
AGCCCTCACTAGCCCCGTGCCTCATCTGCCTACTTGTGCACGCTGCCTTCrTGrGCTGTGGAviGGC'CAGGGGTGGCA-AGCC 
TTCAGGAAiTiATGCCACAGAGGTCATCCCAGGGCTTGCAGAGTACCCCGAGOCTCCTCCTCAGAAC.AACCAGACCATGAA 
15 CCGGGCGGAGAATGGAGGCAGACCTCCCCACCATCCCTATGACGCCAAAGGTACoGGATGAAGAAGCACATTAGTGGGGG 

ggggggtcctggga.ggtgactggggtggttttagcatcttcttc.agaggtttgtgtgggtggctagcctctgctaca.tca 

GGv CAGGG AC AC AT TT G CCT6G AAG AAT ACT AGCACAG CATTAGAACCTGGAGG3C AGCATT GGGGGGCTGGT AGAG AGC 
ACCCAA.GGCAGGGTGGAGGCTGAGGTCAGCCXiAAGCVi’GGCATT.AACACGGGCATGGGCTTGTATGATGGTCCAGAG.AATC 
TCCTCCT.AAGGATGAGGACACAGGTCAGATCTAGCTGCTGACCAGTGGGGAAGT3ATATGGTGAGGCTGGATGCCAGATG 

20 ccatccatggctgtactatatcccacatgaccaccacatgaggtaaagaaggccccagcttgaa.gatggagaaaccgaga 

GGCTCCT GAG AT AAA G T CACCT GGG AGT AAGAAGAGCT G AG.ACTGGAAGCTG 3T T T 3.AT CCAGAT G C AAGGCAACCCT AG 
ATTGGGTTTGGGTGGGAACCTGAAGCCAGGAGGAATCCCTTTAGTTCCCCCTTGGCCAGGGTCTGCTCAATGA-GCCCAGA 
GGGTTAGCATTAAAAGAACAGGGTTTGTAGGTGGCATGTGACATGAGGGGCAGCTGAGTGAAATGTCCCCTGTATGAGCA 
CAGGTGGCACCACTTGCCCTGAGCTTGCACCCTGACCCCAGCTTTGCCTCATTCCTGAGGACAGCAGAAAC7GTGGAGGC 
25 AGAGCCAGCACAGAGAGATGCCTGGGGTGGGGGTGGGGGTATCACGCACGGAACTAGCAGCAATGAATGGGGTGGGGTGG 

CAGCTGGAGGGAC.ACT CCAGAGA.AAT GACCTT GCT GGT CACCATT 7GTGTGGGA3GAGAGCT CATTTTCCAGCTT GCCAC 
CAC ATGCT GTCCCTCCTGTCTCCT AGC C AGT AAGGGAT GTGG AGG AAAGGGCCA.CC CCAAAG GAG CAT *3 CAATGCAGTCA 
CGTTTTTGCAG.AGGAAGTGCTTGACCTAA.GGGCACTATTCTTGG.AAAGCCCCAAAACTAG’rCCTTCCCTGGGCAAACAGG 
CCTCCCCCACATACCACCTCTGCAGGGGTGAGTAAATTAAGCC.AGCC.ACAGAAGGGTGGCAAGGCCTACACCTCCCCCCT 
30 GTTGTGCCCCCCCCCCCCCCGTGAAGGTGCATCCTGGCCTCTGCCCCTCTGGCTTTGGTACTGGGATTTTTTTTTTCCTT 
TT AT GT CAT ATT G ATC CT G ACAC CAT GG AACTTTTGGAG GT AG AC AGG ACCCACACATGG ATT.AGTT AAAAGCCT C C CAT 

CCATCTAAGCTCATGGTAGGAGATAGAGCATGTCC.AAGAGAGGAGGGCAGGCATCAGACCTAGAAGATATGGCTGGGCAT 
CCAACCCAATCT CC TTCCCCGGAGAACAGACTCT AAGTC AGATCCAGCCACCCTTGAGTAA CCAGCT CAAGGT ACACAGA 
ACAAG AGAGT CT GG T AT AC AGCAGGT GC7 AAACAAAT G CTTG T G G T AGC AA AAGC JT AT AGG7TTTGGGTCAG AACT C CGA 

35 CCCAAGTCGCGAGTGAAGAGCGAAAGGCCCTCTACTCGCCACCGCCCCGCCCCCACC7GGGGTCCTATAACAGATCACTT 

TCACCCTT GCGGGAGCC AGA.GAGCCCTGGCAT CCT AGGT AGCCCCCCCCGCCCCCCCCCCGCAAGCAGCCCP.GCCCTGCC ‘ 
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T?TGGGGC/^GTTCTTT?CTC^CC7GGACCT3TGATAATO;W3GC;GC;TTGGACGC0CCGCCTTTGGTCGCT 
.AATGAATTCT TATC CCTA.CC ACC7 G CCC M'TC’I ‘ACCCCGC7 CCTCCACAGC AGCT GTCCTGATTT A7TACC T 
C7CCACTCCTTTCTCCATCTCC?GGGATACCGCCCCT3TCCCAGTGGCTGGTAAAC<AGCT7AGGAAGGAC 



TTCAA.G7CT 

TCAAT7AAC 

CAGAGCJGAG 



GTGTGGCTAGAXiGCTACCAGGCAGGGCTGGGGATGA.GGAGCT.HAACTGGAAGA.GTGTTTGGTTAGTAGGCACA.AAGCCTT 
5 GGGTGGGA.TCCCTAGTACCGGAjGA^GTGGAGATGGGCGCTGAGAAGTTCAAGACCATCCATCCTTAACTAC.ACAGCCAG'T 
77GAGGCCAGCCTGGGCTACATAAAAACCCAATCTCAAOAGCTGC:C.AATTCTGAT?CTG7GCCACGTAGTGCCCGATG7A 
ATAGTGGATGAAG7CGTTGAATCC7GGGGC;ACCTATTT7ACAGATGToGGGAAAAGCAACTTTAAGTACCCTGCCC:AGA 
C*AT::ACAAAGAAA37AA.GTGACAGAC.CTCCAGTGTTTCATCCCTGCA~TTCCAAGGACAGGGAGAGAGA.GCC:AGGGTC-GG 
AT CTCAC7 GCT CCGCGGTGCCTCCTT CCT AT AATCCATACAGATT CGAAAGCGCAGG3CAGGTTTGGAAAAASAGAG AAG 

10 GGTGGAAGGAGCAGACCAGTCrGGGCTAGGCTGCAGCCrCTCACGCATCCCTCTCTCCGCAGATGTGTCCGAGTACAGCT 
GCCGCGAGC7GCACTACACCCGC77CCTGACAGACCCCCCATGCCGCAGCGCCAAGCCGGTCACCGAGTTGGTGTGC7CC 
GGCCAGTGCGGCCCCGCGCGGCTGCTGCCCAA.CGCCATCGGGCGCGTC-.AAGTGGTGGCGCCCGA.ACGG.-.CCGGATTrCCG 

CT GCAT CCCGG AT CG C? A.CCG CGCGC.AGCGGGTGC AG C 7GCT GT G CCCCGGGG3C GCGGCGCCGCGCTCG CGCAA.GGTGC 
C-T CT GGTGGCC 7CGTGCAAGT GCAAG CGCCTCACC CGC7T CCACAACCAGT CG GAGCTCAAGGACT TCGGGC CG GAG ACC 

1 5 GC.'G'.GGCC^GCAG.AAGGGTCGCAAGCCGCGGCCCGGCGCCCGGGGAGCCAAAGCCAACCAGGCGGAGCTGGAGA acgccta 

C7 AG AG C G AGCCCGC G CCT AT G C AG C C CCCG C G CG AT CCG ATTCGTTT TCAGTGT AAAG( :CTGCAGCCCAGG CCAGG3GT 



20 



GCCAAACTTTCCAGAC.'CGTGTGGAGTTCCCAGCCCA' 



*-«A .GA .CCGCA .^GT GOTT CTGCCCGCTGCGGGGGA.TGGGG.AGG 



G33TGGGGTTCCCGCGGGCCAGGAGAGGAAGCTTGA.GTCCCAGAC7CTGCCTAGCCCC3GGTGGGATGGGGGTCTTTCTA. 



CCC7CGCCGGACCTATACAGGACAAGGCAGTGTTTCCACCTTA.AAGGGAAGGGAGTGTGGA.ACGAAAGACCTGGGACTGG 



TTATGGACGTACAGTAAGATCTACTCCTTCCACCCAAATGTAAAGCCTGCGTGGGCTAGATAGGGT77CTGACCCTGACC 

T3GCCACTGAGTGTGATGTTGGGCTACGTGGTTCT'CT , [TTGGTACGGTCTTC7TTGTAA.AATAGGGACC3GAACTCTGCT 

gagattccaaggattggggtaccccgtgtacactggtgagagagaggagaacaggggaggggttaggggagagattgtgg 

TGGGCAACCGCCTAGAAlGAAGCTGTTTGTTGGCTCCCAGCCTCGCCGCCTCAGAGGTTTGGCTTCCCCCACTCCTTCCTC 

TCAAATCTGCCT7CAAATCCATATC.TGGGATAGG3AAGGCCAG33TCCGAGAGATGGTGGAAGGGCCAGAAATCACACTC 

CTGGCCCCCCGAwAG.AGC.AGTGTCCCGCCCCCAACTGCCTTGTC.ATATTGTAA-AGGGATTTTCTACACAACA.GTTTAAGGT 

CGTTGGAGGAAACTGGGCTTGCCAGTCACCTCCCATCCTTGTCCCTTGCCAGGACACCACCTCCTGCCTGCCACCCACGG 

ACA.CA.T T T CTGT CT .AG AAAC AGA.G CGT CGTCGT G CT G7CCTCT G.AGA.C AGC AT ATCT 7 ACATT AAAAAG AAT.AATACGGG 

GGGGGGGGGCGGAGGGCGCAAGTGTT.A7A.CATA.TGCTG.AG.AAGCTGTCAGGCGCCACAGCACCACCCACAATCTTTTTGT 



AAATC A.T T T CC AGAC ACCT CT T AC TTT CT G TG T AGAT7 T T AAT TGTT AAAAGGG GAGG AGAG AGAGCGT TTGT AACAGAA 
3 0 GCACATGGAGGGGGGGG7 AGGGGGGTT GG GGCT G GT GAG TT T GGCGAACT T T CC AT GT GAG AC T CAT CC AC AAA3ACT G A 

AAGCCGCGTTTTTTTTTTTAAGAGTTCAGTGACATATTTATTTTCTCATTTAAGTTATTTATGCCAACATTTTTTTCTTG 

TAGAGAAAGGCAGTGTTAATATCGCTTTGTGAAGCACAACTGTGTGTGGTTTTTTGTTTTTTGTTTTTTCCCCGACCAGa' 

GGC AT T GT T AAT AAAGAC AATGAAT CT CG AGC AG GAGGC TGTGGTC IT GT TT TG T C.AACCACACACAAT GT CT CGC CACT 

GTCATCTCACTCCCTTCCCTTGGTCACAAGACCCAAACCTTGACAACACC7CCGACTGCTCTCTGGTAGCCC7TGTGGCA 
35 ATACGTGTTTCCTTToAAAAGTCACATTCATCCTTTCCTTTGCAAACCTGGCTCTGATTCCCCAGCTGGGTCATCGTCAt' 
ACCCTCACCCCAGCCTCCCTTTAGCTGACCACTCTCCACACTGTCTTCCAAAAGT3CACGTTTCACCGAGCCAGTTCCCT 
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GG7CCAGGTC AT -CCATTGCT CCTCCT TGCT CCAGACCCTTCT CCCACAAAGA.T GTTC ATCTCCCACT C( :A.TC.AA3CCCC 
AGTGGCCCTGCGGCTATCCCTGTCTCTTCAGTTAGCTG.AATCTACTTGCTGACACCACATG.AATTCCTTCCCCTGTCTTA 
AGGTTCATGGAACTCTTSCCTGCCGCTGAACCTTCCA.GGA.CTGTCCCAGCGTCTGA7GTGTCCTCTCTCTTGT.AAAGCCC 
CACCCCAC T ATT TGATTCCCAA.TTCT AGAT CT TCCC TT G'J'T CATT CCTTCACC-GG ATAGT GTCT CATCTGGCC.AAGTCC T 
5 GCTTGATATTGGGATA^ATGC.AAAGCCAAGTACAATTGAGGACCA.GTTCATCATTGGGCCAAGCTTTTTC.AAAATGTGAA 
TTTTAC ACCT AT AGAAGTGT .AAAAGCCTTCCAAAGC AGAGGCA.ATGCCTGGCTCTT CCT T CAACATCAGGGCT CCTGCTT 
TATGGGTCTGGTGGGGTAGTACATTCATAAACCCAACACTAGGGGTGTG.AAAGCAAGATGATTGGGAGTTCGAGGOCAAT 
CTTGGCTATGAGGCCCTGTCTCAACCTCTCCTCCCTCCCTCCAGGGTTTTGTTTTGTTTTGTTTTTTTGATTTGAAACTG 
C AACAC7TT AAAT CCA.GTCAAGTGCATCTTT GCGT GA3GGG AACTCTATCCCTAAT ATA-AGCTTCCATCTT GAT 7T GT GT 

10 ATGTGCACACTGGGGGTT3AACCTGGGCCTTTGTACCTGCCGGGCAAGCTCTCTACTGCTCTA.AACCCAGCCCTCACTGG 

CTTTCTGT7T C.AACT CCCAATGAATTCCCCT AAAT G AATTA7CAAT ATCATGTCT T 7GAAAAAT ACC ATT G AGT GCTGCT 
GGT GTCCCTGT GGTT CCAGAT T CCAGG AAGG AC TT T T C AGGG AAT ( :CAGGCATCCTGAAGAATGT CTT.AGAGCAGGAGGC 

CATGGAG.ACCTTGGCCAGCCCCACAAGGCAGTGTGGTGCA.GAGGGTGAGGATGGA.GGCAGGCTTGCAATTGAAGCTGAGA 

CAGGGTACTCAGGATTAAAAAGCTTCCCCCAAAACAATTCCAAGATCAGTTCCTGGTACTTGCACCTGTTCAGCTATGCA 

1 5 G AGCCCAGTGGGCAT AG G T GAAGACA CCGGTT GT ACT GT CAT G TACTAAC T GTG CT T CAGAG CCGGCAGAGACAAA.TAAT 

GTTATGC.TGACCCCAGGGGACAGTGATTCCAGAAGGAACACAGAAGAGAGTGCTGCTAGAGGCTGCCTGAAGC4AGAA3GG 

GTCCCAG AC TC T CT AAGC AAAGACTCCACTC AC AT AAAGAGAC AGGCTG AG C AG AC CTG G CCGTGG ATGC AG GG A.GCCC A 

tccacca?cctttagcatgcccttgtattcccatcacatgccagggatgaggggcatcagagagtcc:aagtgatgcccaa. 

A.CC CAAACACACCT AGGACTT GCTTT CTGGGA.CAjGACAGATGCAGGA.GAGACTAGGTTGGGCTGTGATCCCATTACCACA 
20 AAGAGGGAAA fi AACAAAAAACAAACAAA.CAAACAAAAAAAAACAAAACAAAACA a A AAAAA AC CC AAGGTCC AAAT T GT A 

GGT C AGG T T AGAG T TT AT T TAT GG AAAGT TAT ATT C T A CCT C C ATGGGG7C T.ACAAGGCT GGCGCCC ATC AG AAAG.AAC A 

AACAACAGGCTGATCTGGGAGSGGTGGTACTC7ATCGCAGGGA.GCACGTGTSCTTGGG3TACAGCCAG.ACACGGGGCTTG . 
TATTAATCACAGGGCTTGTATTAATAGGCTGAGAGTCAAGCAGACAGAGAGACAG G AGGA_AACACACACACACACAC.ACA 

C AC ACAC AC ACACACACAC AC AT GCACACAC CAC TC ACT T CT C ACT CG AAG AG CCCCT AC TT AC AT TCT AAGAACAAACC 
25 ATTCCTCCT CAT AAAGGA.GACAAAGTT GC AG AAA CC C.AAA AG AGC CAC AG GGTCCCCACTCTCTTTG AAA T GACTTGGAC 

TTG TTG C AG GGAAG ACAG AGG GGT CTGC AGAGGCTT CC TGGGT G ACCC AGAGCCAC AG ACA.C T G AAATCT GGT GCT GAGA 

CCTGTATAAACCCTCTTCCACAGGTTCCCTGAAAGGA.GCCCACATTCCCCAACCCTGTCTCCTGACCACTGAGGATGAG.A 

GCACTTGGGCCTTCCCCATTCTTGGAGTGCACCCTGGTTTCCCCATCTGAGGGCACATGAGGTCTCAGGTCTTGGGAAAG 

TTCCACAAGTATTGAAAGTGTTCTTGTTTTGTTTGTGATTTAATTTAGGTGTATGAGTGCTTTTGCTTGAATATPTGCCT 

3 0 'oTG TA.* jC. u .T T TAC.AA uC CT GGT G CC T G AGGAG AT CAG.AAG AT G G CAT C AGA TACCCTGG AAC T GGA.C TTGC AGAC AGTT A 

TGAGCCACTGTGTGGGTGCTAGGAACAGAACCTGGATCCTCCGGAAGAGCAGACAGCCAGCGCTCTTAGCCACTAAGCCA 

TCACTGAGGTTCTTTCTGT GGCT AAAG AGACAGGAG AC AAAGG AG AG TTTCTTT TAGTCA ATAGGACCATG AAT GT TCCT 

CGTAACGTGAGACTA.GGGCAGGGTGATCCCCCAGTGACACCGATGGCCCTGTGTAGTTATTAGCAGCTCTAGTCTT.ATTC 

CT TAATAAGTCCCAG TT T GG3GCAGGAGAT AT GT ATT C CCTGCTT TGAAGT G G CTG AGG TCCAGT T AT CT ACTT CCAAGT 

35 ACTTt3TTTCTCTT?CTGGAGTTCGGG.tiAGCrCCCT3CCTGCCTGT.W.TGT0TCCATTCTTCAACCTTAGACAA3?.TCAC 

tttccctgagcagtcaggccagtccaaagcccttcaatttagctttcataaggaacaccccttttgttgggtggaggtag 
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CACTTGCCTTa.^TCCCAGCAT?aACMe(3CASAGftCAGTC3GATCTC"or3>.«TTCAC.t'.5CC?.GCCT.3--TCTACGnftOT 
GA.GTTCCAAGACAGCCAGGCCTA.CACAGAGAAAOCCI^GTCTCG AAAAAAACAAAA ACAA A.AGAAATA ‘AAGA a AAA G AAA A 
■CAAAAACGAACAAACAGAAAAACAAGCCAGAGTGTTTGTCCCCGTATTTTATTAATCAT.ATTTTTGTCCCTTTGCCATTT 
T AG ACT AAAAGACT CGGG AAAGCAGGT CTCTCTCT GT TTCTCATCCGGACACACCC.AGAACCAGATGT ATG GAAG A7GGC 
5 T AA7 GTGCT GCAGTTGCACATCT GGGGC7GGGT GG ATT GGTT AG ATGGCAT GGGCTGG GTGT GGTT ACGAT G ACTGC A3G 

AGCAAGC-.AGT AT GT GGT GC.AT AGCAAACGAGG AAGTTTG CACAG AAC AACACT GTGTGTACT GATGTG C.AGG T ATGGGCA 
CA7GCAAGCAGAAGCCAAGGGACAGCCTTAGGGTAGTCTTTCCACAGACCCCTCCCCCCTTTTAACATGGGCATCTCTCA 
TTGGCCTGGAGCTTGCC.AACTGGGCTGGGCTGGCT.AGCTTGTAGG'l'CCCAGGGATCTGCATATCTCTGCCTCCCTAGTGC 

TGGG ATT ACAGTC AT AT ATGAGC AC ACCTGGCTTT TT T.ATGT GGGTT CTG-GG CT TT GAACCC AGATCTGAGTG CTT GCAA 
1 0 GGCAATCG GTTG AATGACTGCTT CATCT CCCCAGACCC? GGGATTCT.ACT 77CTAT T AAAGT ATTTCT ATT AAATC.AATG 

Av.'CCCCT-oCCCCTGCACTCAGCAGTTCTTA.GGCCTGCTGACAGTC.AAGTGGGG.AGTGAGAGCAAGCCTCGAGACCCCATC 
AGCGAAGCAGAGGACAAAGAAATGAAAACTTGGGATTCGAGGCTCGi-IGArATGGAGATACAG.AAAGGGTCAGGGAAGGAA 
ATGAACCAGATGAATAGAG3CAGGAAGGGTAGGGCCCTGCATACATGGAACCTGGTGTACATGTTATCTGCATGGGGTTT 
GCATTGCAATGGCTCTTCAGCAGGTTCACCACACTGGGAAACAGAAGCCAAAAAGAAGAGTAGGTGGTGTTG3AGTCAGA 
15 TACTGTCAGTCATGCCTGAAGAA_A?GGPAGCAA?TA a .CGATGCGCCC-C.AATTAGGATAT?AGCTCCC7G.AAG.AAAGGC^A 
GAAGCTGGGCTGTGGGCACTG.AAGG3AGCTTTGAATGATGTCACATTCTC7GTATGCCTAGCAiiGt:CAGTAT7GGAGACT 
GAGACTTGACTTGTGTGTCCATATGATTCCTCCTTTTCCTACAGTCATCTGGGGCTCCTGACCTTCGTCCTTGTCCAAGA 
ACCTGGAGCTGGCAGTCGGCAGCTGCAGTGATAGATGTCTGCAAGAAAGATCTGAAAAGAGGGAGGAAGATGAAGGACCC 
AGAGGACCACCGACCTCTGCTGCCTGACAAAGCTC4C.AGGACCAGTCTCTCCTACAGATGGGAGACAGAGGCGACAGATGA 
20 ATuGT'-AvjGGGAGGAGT cagagaaaggagagggtgaggcagagacc.-a.aggagggaaacacttgtgctctacagctactg 

ACTGAGTACCAGCTGCGTGGCAGACAGCCAATGCCAAGGCTCGGCTGATCATGGCACCTCGTGGGACTCCTAGCCCAGTG 
CTGGCAGAGGGGAGTGCTGA.ATGGTGCATGGTTTGCATATGATCTGAATGTGGTCCAGCCCTAGTTTCCTTCC-^GTTGCT 
GGGATAAAGCACCCTGACCAAAGCTACTTTTTTGTTTGTTTGTTTTGGTTTGGTTTTGTTTGGTTTTTCGAGGCAGGGTT 
TCTCTGTATCACCCTAGCTGTCCTGGAAC7CACTCTCTAGACCAGGCTGGCCTCG.AACTCAG.AAATCCCCCTGCCTCTGC 

25 CTCCT AAG TG CTG GAAT T AAAGGCCT GCGC CAC CACTGCCGGCC C AAAGCT ACTTTAAGAGAGAG AGAGGAAV G T AT AAG 

TATTATAATTCCAGGTTATAGTTCATTGCTGTAGAATTGGAGTCTTCATATTCCAGGTAATCTCCCACAGACATGCCACA 

.AAACAACCT GTTCT ACGAAAT CT CTCAT GGACT CCCTTCCCCAGT AAT TCTAAACTGTGT CAAATCT ACAAG AAAT AGTG 

. ACAGTCACA.GTCTCTAACGTTTTGGGCATGAGTCTGAAGTCTCATTGCTAAGTACTGGG.AAGATGAAAACTTTACCTAGT 
GTCAGCATTTGGAGCAGAGCCTTTGGGATTTGAGAT3C-TCTTTTGCAGAGCTCCTAATGGCTACATGGAGAGAGGGGGCC 
30 TGGGAGAGACCCATACACCTTTTGCTGCCTTATGTCACCTGACCTG-rCCTTGGGAAGCTCTAGCAAGAAGGCCTTCCCT 

G GAT CACCCAC CACCTT GC ACCTC C AG AACT CAG AG CCAAATT AAACTT T CTT GT T ACT GTCGT C.AA AGO AC AG T CG G T C 
TGGGTTGT ATC ACT GTCAAT GGGAAACAG ACTT GCCTGG AT GG AT AACT T GT ACATT GCA7AAT G TCT AG AA.A7GAAA.AG 

TCCTATAGAGAAAAAGAAAATT.AGCTGGCACACAGATAGAG3CCCTGGAGGAGGCTGGCTTTGTCCTCCCCGAGGAGGTG 

GCGAGT AAGGT GT AAATGTTCATGGATGT AAAT GGGCCCATAT AT GAGGG7CTGGGGT AACAAG AAGGCCTGTGAAT AT A 
AAGC ACTGAmAG GT AT GTCT AGT CTG GAG AAG GT C ACT AC AG AG AG TTCTCCAACT CAGTGCC CAT ACACACAC.AC ACACA 

CACACACACACACACACACACACACACACACCACAAAGAAAAAAAG3AAGAAAAATCTGAGAGCAAGTACAGTACTTAAA 
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AT'rGTG'rttW'TSTGTGTGTGACTCTGATGTCACa.TGCTCATCTTGCCCTATGAGTTGLfiAaACCAAATGGCCCCTGAGAGw 

CAT AACAACC AC\AC TGT7G GCT 3TG TGCT CAC3TT T TT CTT AAAGCGTCT GTCTt-iWT'l TGCTGC T AGCA.T CAG GC A.GACT 
T GCAGCAG ACTA. CAT AT GCT C ASC OCT G AAGTCCT T CT AGG GTG C AT GTCTCTT CAGAAT TTCAGAAAGT CAT CTG7GGC 
TCCAGCACCGCCTGCACTCTCCCTCTGCCGCGAGGCTGCAGACTCTAGGCTGGGGTGGAAGCAACGCTTACCTCTGGGAC 
5 AAGTATAAC.A7GT7GGC7TTTCTT7CCC7CTGTGGCTCC.AACCTGGA.CATAAAAT.AGATGCAAGCTGTG7AATA.AATA.T7 

tcctcccgtccacttacttctcaac.aataactactctgagagcacttattaataggtggcttagacat^agc:tttggctc 

attcccccactagctcttacttcttt.aactctttc.aaaccattctgtgtcttccacatggttagttacctctccttccat 

cctggttcgcttcttccttcgagtcgccctcagtgtctctaggtgatgcttgtaagatattctttctacaaagctg.agag 

tggtggcactctgggagttca.aagccagcctgatctacacagc.aagctccagcatatccagggcaatgttgggaaaacct 

10 T T CT C AAAC.-.--AAAG AGGGGT T CAGT T GTC.AGGAGG.AGACCC.ATGGGTTAAGAA.GTCTA.GA.CGAGCCATGGTGATGCAT A 

cctttcatccaagcacttaggaggc.aaacaaaggtgaaactctttgactttgaggccagctaccttacatagtgataccc 

TGCTT AGT 3T G TGTGT GTGTG7 GTGT CTGTGTGTGTGTGTGTGTGT AATT T AAAAG TCT.AA.W.TGC AT TCTTTT.-AAAA 

tatgtat.aagtatttgcctgcacatatgtatgtatgtatgtataccatgtgtgtgtctggtgctgaaggactaggcatag 

ACTCCCTAGAACTAGAGTCATAGACAGTTGTGACACTCCCCAACCCCCCACCATGTG.GGTGCTTG.AAGCTA-AACTCCTGT 
1 5 CCTTTGTAAA.GCAGCAGGTGTCTA7GAA.CCC?GAACCA7C7(;TCCAGTC7CCA3ATGTGCATTCTCAAAGAGGAG7CCTT 
C.ATATTTCOCTAAACTG.AACATCCT7ATCAG7GAGCATCCTCGAGTCACC.AAA3CTACTGC.AAACCCTCTTA.GGGAA.CAT 
TCACTATTCACTTCTACTTGGCTCATGAAACTTAA.GTACACACACACAAA.CACACACACACACACAGAGTCATGCACTCA 
CAAAAGCATGCATGTACACCATTCTTATTAGACTATGCTTTGCTAAAAGACTTTCCTAGATACTTTAAAACATCACTTCT 
GCCTTTTGGTGGGCAGGTTCCAAGATTGGTACTGGCGTACTGGA-AACTGAACAAGG’i'AGLAG.ATCTAGAAATC.ACAGCAGG 
20 TCAGAAGGGCCAGOCTGTACAA.GAGAGAGTTCCACACCTTCCAGGAACACTGAGCAGGGGGCTGGGACCTTGCCTCTCAG 
CCCAAGAAA.CTA.GTGCGTTTCCTGTA.ToCATGCCTCTCA.GAGAJTCCA.TAAGA.TCTGCCTTCTGCCA.TAAG.ATCTCCTGC 
ATCCAGACAAGCCTAGGGGAAGTTGAGAGGCTGCCTGAGTCTC7CCCACAGGCCCCTTCTTGCCTGGCAGTAT7TTTTTA 

TCT GGAGG AGAG G AA.T C AG 3GT G GG.AAT G AT CAAAT AC AA.T T ATCAA.GGAAAAA.GT AAAA.A.AC AT AT AT AT ATA.T AT AT 1 T 

AACTGATCTAGGG^GCTGGCTCAGCAGTTAAGAGTTCTGGCTGCCCTTGCTTCAGATCTTGCTTTGATTCCCAGCACCCA 

25 CATGA.TGGCTTT CAACT GT A.T CT CT GCT T CCAGG3GAT CCAACA.GCCTCT T CTG ACCTCC AT AG AC AAG AC CT AGT C CTC 

TGCAA*GAGCACCAAATGCTCTTATCTGTTGATCCATCTCTCTAGCCTCATGCCAGATCATTTAA n ACTACTGGACACT3T 

CC CAT T T T ACGAAGAT GT C ACT GCCC AGT CAT 7T 3CC AT GAG T GGATAT TTCGATTCTTTCTATGTTCTCACCCT T GC AA 

TTTATAAGAAAGATATCTGCATTTGTCTCCTGAGAGAA.CAAAGGGTGGAGGGCTACTGAGATGCCTCTAGGGGTAAA.G3T 

GCT T GCC ACAAAA.T CT GACAACT T AAG TTTGGTCTTGG AAT CCACA.T GGTGG AG AG AGAGAA.G AG AT T CCCG'i* AAG T T GT 
3 0 CCT CAAACTTCCCACACATGT GCT GTGGCTTATGTGT AA.CC CC AATAAGT AAAGA.T AGTTT TAAACA.CTACA.TAA.GGT AG 

GGT T TCT T C AT G AC CCCAAGGAAT G AT GC CCC T G A.T A.G AGCT T A.TG CT GAAA.CC CC AT CT C C AT T GT G CC AT CT 3G .AAAG 
AGACAATT GCATCCCGG AAACA.GAATCTT CAT G AATGG ATTAATGAGCT AT T AAG AAAGTGG CTTGG T TAT TG CAC AT GC 
TGGCGGCGT AAT G ACCT CC ACC AT GAT GT TAT C CAG CAT GAAGGTCCT CACC.AG AAGTC AT ACAAAT CT TCTT A3G CTTC 

CAGAGTCGTGAC-CAAAAAAAGCACACCTCTAAATAAATTAACTAGCCTCAGGTAGTT.AA.CCACCGAAAATGAA.CC.AAGGC 
35 AGTTCTAATA.CAAAACCACTTCCCTTCCCTGTTCAAA.CCACAGTGCCCTATTATCT.AAAAGATAAACTTCAAGCCAAGCT 
TTTAGGTTGCCAGTATTTATGTAACAACAAGGCCCGTTGACACACATCTGTAACTCCTAGTACTGGGCCTCAGG.3GCA.GA 
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GACAGGTGaa5CCCTGGAGTTTGAaT7CCAGGTTC73T5AGaA0.CTCTGTC75PAAAGACaA7ATGGTGAGTCxACC0'3G-? 
AGGATATCT3ATATTGACTTCTG3CCAACACACA3CCATCTCTGCACATC7GTAC-T7GCAAGCCT7TTGCACT.AAG7TTG 
GCCAGAGTCAGAGTTTGCAAGTGTT-TGTGGACTGAATGCACGTGTTGCTGGTGATCTACAAAGTCACCCTCCTTCTCAAG 
CT AGCAGCAC7 G GCTT CGGCCAGC7GCTCATT CAAG CCTCTT TGC AGAGTCATCACGGGG ATGGGGG AGC AG5GCCC CTC 
5 (:CTAGAACACC.AAGCCTGTGGTTGTTTATT(‘:AGGACAT'l\AT , l , GAGGC^CCAA.GATGACAGATAACTCTATCACTTGGCC.=A 
CAGTCGCGTG7TCCC.GTGTTAGC.TTATTTCTGTGTCT3CAGAAAACAGTGCAACCTGGACAAAAGAAA7.AAATGA7ATCA 
TTTTTCATTCAGGCAACTAGATTCCGTGGTACAAAACwirrrcccrrUGGGP-ACGASGCCGGGACAGCGCGGCTCCTGAGTCG 
CTATTTCCGTCTGTCAACTTCTCTAATCTCTTGATTTCCrrCCCTCTGTCTGTTTCCTTCCTCTTGCTGGGGCCCAGTGGA 
GTCTGTGTACTCACAGGGAGGAGGGTGGCAAAGCnCTGGTCCTCTACGGGCTGGGGGAAGGGGGGAAGCTGTCGGCCCAG 
10 TGACTTT TT CCCCTTTCTCT TTT TCT T AGAAACCAG7 CTCAATTT AA6A7 AAT3AGT CTCCT CAT? CACGTGTc C? CAC7 

ATTCATA5GQACfTATCCACCCCCGCCCTGTCAATCT5GCTAAGTA.0GACA.AGTCAAATTTAAAAGGGAACGTTTTTCTA 
/••AAATGT GGCT GGACCGTGTGCCGGC.ACGAAACCAG 33A.TCGCGG T CT.AAG7 7ACA.T GCT CT CTGCCAGCCCCGGTGCC 7 
77TCCTTTCGGAAA3GAGACCCGGA3GTAAAACGAAG7TGCCAACTTTTAATGATGGTGTGCGCCGGGTGACTCTTTAA-. 
A.TGTCATCCATA.CCTGGGATAGGGAAGGCTCTTCAGGGAGTCATCTAGCCCTCCCTTCAGG.AAAAGATTCCACTTCCGGT 
15 TT.AGTTAGCTTCCACCTGGTCCCTTATCCGCTGTCTCTGCCC.ACTAGTCCTCATCCATCCGGTTTf.xrGCCCTCATCCACC 
TTGCCCTTTTAGTTCCTAGAAAGCAGCACCGTAG7CT7GGCAGGTGGGCCATTGGTCACTCCCCTA.CCAC7GTTACCATG 
GCCACCAAGGTGTCATTTAAATATGAGCTCACTGAGTCCTGCGGGATGGCTTGGTTGGTAATATGCTTGCTGCAAAATCG 
T G AG AA CT GGAGT T CAAT TCCC AGC AC AT GG AT GT AT T 7 CC AG C ACCT GG AAGGC AGGGAGCAG AG AT CT 7 AAAGCT CCT 
GGCCAGACAGCCCAGCCTAATTAGTAATCAGTGAGAGACCCTGTCTCAAGAAACAAGATGGAACATCAAAGGTCAACCTC 
20 TTGTCTCCACA.CACAC.AAAl-ACACACATGCACATACATCCACACACAGGCAAACACATGCACACACCTGAACACCCTCCA 
CAAATACATACATAAAAAAATAAATACATACACACATACA7ACATACACCAACATTCCCTCTCCT7A.G7CTCCTGGCTAC 
GCTCTTGTCACCCCCACTAAGGCTTCAACTTCTTCTA7TTG7TCATCTTGACTCCTCTGTACTTTGCA7GCCTTT7CCAG 
CAAAGG CT TT T CTTTAAATCT C CGTC ATTC AT AAAC 7 C CC 7 CT.AAATT TCTTCCCCTGCCCTTTTCTTTCTCTCTAG3GA 
GATAAAGACA.CACACTAC.AAAGTCACCGTGGGACCAGTTTATTCACCCACCCACCCCTGCTTCTGTTCATCCGGCCAGCT 
25 AAGTAGTCCAACCTCTCTGGTGCTGTACCCTGGACCCTGGCTTCACCACAGCTCCTCCATGCTACCCAGCCCTGCAAACC 
TTCAGCCTAGCCTCTGGTTCTCCAACCAGCACAGGCCCAGTCTGGCTTCTATGTCCTAGA-AATCTCCTTCATTCTCTCCA 
TTT CCCTCCT G AATCT ACCACCTTC7T TCTCCCTTCT CCT GACCTCTAAT GTCTT GGTC AAACGATT AC.AAGGAAGCCAA 
T GAAATTAGCAGT T7 GGGGT ACCT C AG AGT C AGCAGGGGA.GCT GGGAT GAAT T C ACATT T CCAGGC C TT T G CTT T GC ?CC 

CCGaATTCTGACAGGCAGTTCCGAAGCTGAGTCCACCAAGCTGAATTTAAAATCACACTCCAGCTGGGTTCTGAGGCAGC 
30 CCTACCACATCAGGTGGCCCTGACTGAGCTGTGTCTGGGTGGCAGTGGTGCTGGTGGTGCTGGTGGTGCTGGTGGTGGTG 
GTGGTGGTGGTGGTGGTGGTGGTGGTGGTGTGTGTCTCTGTTTTCTGCTTTTACAAAACTTTTCTAATTCTTATACAAAG 
GACAAATCTGCCTCATATAGGCAGA.AAGATGACTTATGCCTATAT.AAGATAT.AAAGATGACTTTATGCCACTTATTAGCA 
AT AGT T ACTGT C AAAAGT AAT? CT AT T T AT ACACCCT T AT AC ATG G T ATT GCTT T 7GTTGG AG AC TCT AAAATCC A.GAT T 
A7GTATTTAAAAAAAAATTCCCCAGTCCTTAAAAGGTGAAGPATGGACCCAGATAGAAGGTCACGGCACAAGTATGGAG7 
3 5 C GGAGT GT GGAGTCCTGCCAATGGTCTGGACAGAAGCATCCAGAGAGGGTCCAAGACAAATGCCTCGCCTCCTAAGGAAC 

ACTGGCAGCCC7GATGAGGTACCAGAGATTGCTAAGTGGAGGAATACAGGATCAGACCCATGGAGGGGCTTAAAGCGTGA 
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CTGTAGCAC4CCC?CCGCTGAGGGGCTCCAGGTGG3CGCCCA. a *GGrGCTGCAGTGGGAGCCACATGAGAGGTGATGTC:TTG 

GAG T C AC CT CGGG T AC C AT T GTT T AGG G AG» jTG G GG AT T TGT G G T GTGGAG AC AGG C AG CC T C AAGG ATGCTT T T C AACA 

AT3GTTGATGAG7TGGAACTAAAACAGGGGCCATCACACTGGCTCCCATAGC7CTGGGCTTGCCAGCTTCCACATC7GCC 

CCC C ACC CCCTGTCT G GC ACCAG CT C.AAGCTC T G TG AT T CTAC ACAT CCPAAAGAG G AAGAGT AGCCT ACT GGGC A? G CC 

5 ACCTCTTCTGGACCATCAGGTGAGAGTGTC.CCAAGCCCTAGGCTCCTGTCCAGGATGCAGGGCTGCCAGATAGGATGCTC 
AGCTATCTCCTGAGCTGG. n ACTATTTTAGGA n .TAAGGATTATGCCCGCCCGGGGTTGGCCAGCACCCCAGC.AGCCTGTGC 
TTGCGTAAAAGCAAGTGCTGTTGATTTATCTAAAAACAGAuCCGTGGACCCA.CCCA.CAGGACPA.GTATGTATGC.ATCTGT 
TT C ATGTA7 CT G AAAAGC GAC ACAACC ATTTTT C AC ATC ATGGCATCTTCCTAACCCCC ATTCTTTT TTGTTT TGT TTTT 
TTGAGACAGGGT TTCT CTGTGT AGTCC TGGCTCTCCTGG AACTCACTT TGTAGACCAGGCTGGCCTCGAACTC AG A £ ATC 
10 CTGGGATTAAAGGTGTGTGCCACCACGCCCGGCCCTPACCCCCATTCTTPATGGTGATCCASTGGTTGAA.ATTTCGGGCC 
ACACACATGTCCATTAGGGATTAGCTGCTGTCTTCTGAGCTACCTGGTACAATCTTTATCCCCTGGGGCCTGGGCTCCTG 
A.TCCC7GACT CG GGCCCG ATCAAGT CCAGT TCCT GGGCCCGATCAAG TC:CAGTT CCTGGGCCCGAAC AAGTCC AGTCCCT 
AGCTCGAT7A3CTCATCCTGGCTCCCTGGCCTGTTCTTACTTACACTCTTCCCCT7GCTCTGGACTTG7TGCTTTCTTTA 
CTCAAGTTGTCTGCCACA.GTCCCTA. 5 .GCCACCTCT3TA 0 .GACA a .CTA. t .GATAATAcr , J’CCCTCiAAGCACGG.AA.AGTCC7G 
15 AGTCACCACACCCTCTGGAGGTGTGTGGACACATGTTCATGCGTGTGGTTGCGCTTACGTACGTGTGC 

Sequence ID No. 18: Human Beer Genomic Sequence (This gene has two exons, at 
positions 161-427 abd 3186-5219) : 

20 





tagaggagaa 


gtctttgggg 


agggtttgct 


ctgagcacac 


ccctttccct 


ccctccgggg 


60 




ctgagggaaa 


catgggacca 


gccctgcccc 


agcctgtcct 


cattggctgg 


catgaagcag 


120 


25 


; agaggggctt 


taaaaaggcg 


accgtgtctc 


ggctggagac 


cagagcctgt 


gctactggaa 


180 




ggtggcgtgc 


cctcctctgg 


ctggtaccat 


gcagctccca 


ctggccctgt 


gtctcgtctg 


240 




cctgctggta 


cacacagcct 


cccgtgtagt 


gg^gggccag 


gggtggcagg 


cgttcaagaa 


300 


30 


















tgatgccacg 


gaaatcatcc 


ccgagctcgg 


agagtacccc 


gagcctccac 


cggagctgga 


360 




gaacaacaag 


accatgaacc 


gggcggagaa 


cggagggcgg 


cccccccacc 


acccctttga 


420 


35 


gaccaaaggt 


atggggtgga 


ggagagaatt 


cttagtaaaa 


gatcctgggg 


aggttttaga 


480 
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aacttctctt tgggaggctt ggaacactgg 

ccagcactgg tcgaggaaca gtcttgcctg 

5 gccttcaaat tcaggtgcag aggcatgagg 

gaggacgctg gggtggtgag ggtatggcat 

aaaagaaaag gtttcaaaga atctcctcct 

10 

caccactggg aagggaacaa ggtaagggag 

caccggacac tctatgctgg tggtggctgt 

15 cccaggaggt gaacccccag ctcgaagggg 

ggtagtgaga agagctgagg tgtgaacctg 

tgcggggggg tgtgggggac agatccccac 

20 

cccctgcagt gtgcattgcc cacggcctgc 

ttttcggcac agttttagcc cctgacatgg 

25 agcagcatcc tctgtgcagg agtagggaca 

cttgcctcat tccaggggag ggagaaggaa 

acagagaagc ccaggcgaca gtgtgcatct 

30 

atgggggcgt ctgaaatgac act tcagact 

aggtggcaga gaagtccact gcccaggctc 

35 tgttgggact atggggtgct aaaaagggca 
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ggtagaccca gtyaagattg ctggcccccg 540 
gaggtggggg aagaatggct cgctggtgca 600 
caacagacgc tggtgagagc ccagggcagg 660 
cagggcatca gaacaggctc aggggctcag 720 
gggaatatag gagccacgtc cagctgctgg 780 
cctcccatcc acagaacagc acctgtgggg 840 
ccccaccaca cagacccaca tcatggaatc 900 
aagaaacagg ttccaggcac tcagtaactt 960 
gtttgatcca actgcaagat agccctggtg 1020 
aaagcagtgg ggaggaaggc cagagaggca 1080 
ccagggagct ggcacttgaa ggaatgggag 1140 
gtgcagctga gtccaggccc tggaggggag 1200 
tctgtcctca gcagccaccc cagtcccaac 1260 
gaggaaccct gggttcctgg tcaggcctgc 1320 
ggctctataa ttggcaggaa tcctgaggcc 1380 
aagagcttcc ctgtcctctg gccat“atcc 1440 
ctggacccca gccctccccg cctcacaacc 1500 
actgcatggg aggccagcca ggaccctccg 1560 
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tcttcaaaat ggaggacaag ggcgcctccc 
ngggctccag cgactgcctg aagggctgta 
5 tggactccca cgagaggcca cagcccctga 
cctgcagagg aagcgcctgg cctaggggcg 
tccctgggca aatgcccccc tgaccacaca 

10 

aaaccagccc acagaccaga aagcagcccc 
tgtgcttgct cttcagagtg ggggtggggg 
15 tcttaagact atttttcatt cttccttgtc 
gggtggactg gtactcacac gacgaccagc 
accataggag acatggtcaa ggtgtgtgca 

20 

tctgcctgcc cagggagtat caccatgagg 

gtgcccagca gagagccagg tcaatgtttg 

25 agctcagggc ccctatggta ggaaagtaac 

cccagcccag ccccccatgg atgctcgaac 

ggtgctggga ccccagggaa gtggagtccg 
30 

ttttctctgg ctgggcctca gtattctcat 
tgattccttt caagtctaat gaattcctgt 
35 ccacagcagc ;tgccctgatt tattaccttc 
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cccacagctc cccttctagg caaggtcagc 1620 
aggaacccaa acacaaaatg tccaccttgc 1680 
ggaagccaca tgctcaaaac aaagtcatga 1740 
ctattctcga aaagccgcaa aatgccccct 1800 
cacattccag ccctgcagag gtgaggatgc 1860 
agacgatggc agtggccaca tctcccctgc 1920 
gtggccttct ctgtcccctc tctggattgg 1980 
acattggaac tatccccatg aaaccttLgg 2040 
tatttaaaaa gctcccaccc atctaagtcc 2100 
ggggatcagg ccaggcctcg gagcccaatc 2160 
cgcccactca gataacacag aacaagaaat 2220 
Cggcagctga acctgtaggt tttgggtcag 2280 
gacagtaaaa agcagccctc agctccatcc 2340 
gcagagcctc cactcttgcc ggagccaaaa 2400 
gagatgcagc ccagcctttt gggcaagttc 2460 
tgataatgag ggggttggac acactgcctt 2520 
cctgatcacc tcccctccag tccctcgcct 2580 
aattaacctc tactcctttc tccatcccct 2640 
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gtccacccct cccaagtggc cggaaaagga atttgggaga agccagagcc aggcagaagg 2700 

tgtgctgagt acttaccctg cccaggccag ggaccctgcg gcacaagtgt ggcttaaatc 2760 

5 ataagaagac cccagaagag aaatgataat aataatacat aacagccgac gctttcagct 2820 

atatgtgcca aatggtattt tctgcattgc gtgtgtaatg gattaactcg caatgcttgg 2880 

ggcggcccat tttgcagaca ggaagaagag agaggttaag gaacttgccc aagatgacac 2940 

10 

ctgcagtgag cgatggagcc ctggtgtttg aaccccagca gtcatttggc tccgagggga 3000 

cagggtgcgc aggagagctt tzccaccagct ctagagcatc tgggaccttc ctgcaataga 3060 

15 tgttcagggg caaaa'gcctc tggagacagg cctggcaaaa gcagggctgg ggtggagaga 3120 

gacgggccgg tccagggcag gggtggccag gcgggcggcc accctcacgc gcgcctctct 3180 

ccacagacgt gtccgagtac agctgccgcg agctgcactt cacccgctac gtgaccgatg 3240 

20 

ggccgtgccg cagcgccaag ccggtcaccg agctggtgtg ctccggccag tgcggcccgg 3300 

cgcgcctgct gcccaacgcc atcggccgcg gcaagtggtg gcgacctagt gggcccgact 3360 

25 tccgctgcat ccccgaccgc taccgcgcgc agcgcgtgca gctgctgtgt ccc ggtggtg 3420 

aggcgccgcg cgcgcgcaag gtgcgcctgg tggcctcgtg caagtgcaag cgccccaccc 3480 

gcttccacaa ccagtcggag ctcaaggact tcgggaccga ggccgctcgg ccgcagaagg 3540 

30 

gccggaagcc gcggccccgc gcccggagcg ccaaagccaa ccaggcc'gag ctggagaacg 3600 

cctactagag cccgcccgcg cccctcccca ccggcgggcg ccccggccct gaacccgcgc 3660 

35 cccacatttc tgtcctctgc gcgtggtttg attgtttiata tttcattgta aatgcctgca 3720 
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acccagggca gggggctgag accttccagg 
cccctcagcc cgccagctga ggggtcccac 
5 cactgagcca cgcagccccg cctctggggc 
aggcagaaat ggaagcattt tcaccgcc ct 
aaagtccagg gactggttaa gaaagttgga 

10 

cagaaagcct gaggcgtgcc cagagcacaa 
gtcctggctc tgccactaac ttgctgtgta 
15 ctcaatttcc actttgtaaa atgagggtgg 
ggcatatgat tccaaggact ccagtgcctt 
gaaagagaga gaatgaatgc agttgcattg 

20 

gagttgtgat gctctcttct gacagccaaa 

agagtctatt tatggctgac atatttacgg 

25 .-ttcccagcct ggcttccccg gatgtttggc 

catcatccat tggggtagaa aaggagaggg 

atccgcccca acttcccaaa gagcagcatc 
30 

tcaccttccg aagagaagtg aaaggttcaa 
gccatcacaa actcacagac cagcacatcc 
35 cacggacaca tttctgccta gaaaacagct 
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ccctgaggaa tcccgggcgc cggcaaggcc 3780 
ggggcagggg agggaattga gagtcacaga 3840 
cgcctacctt tgctggtccc acttcagagg 3900 
ggggttttaa gggagcggtg tgggagtggg 3960 
taagatt ccc ccttgcacct cgcugcccat 4020 
gactgggggc aactgtagat gtggttccta 4080 
accttgaact acacaattct ccttcgggac 4140 
a g 9 tgggaat aggatctcga ggagacnatt 4200 
ttgaatgggc agaggtgaga gagagagaga 4260 
attcagtgcc aaggtcactt ccagaactca 4320 
gatgaaaaac aaacagaaaa aaaaaagtaa 4380 
ctgacaaact cctggaagaa gctatgctgc 4440 
tacctccacc cctccatctc aaagaaataa 4500 
tccgagggtg gtgggaggga tagaaatcac 4560 
cctcccccga cccatagcca tgttttaaag 4620 
ggacactggc cttgcaggcc cgagggagca 4680 
cttttgagac accgccttct gcccaccact 4740 
tcttactgct cttacatrgtg atggcatatc 4800 
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ttacactaaa agaatattat tgggggaaaa 
tgcagagcat aatagctgcc acccaaaaat 
5 tactttctgt gtagttttta attgttaaaa 
tatgaaagcc tgcaggactg gtcgtttttt 
aagaatgaaa gtagtggttt ttaaagagtt 

10 

atttatgcaa aagttcttct tgtagagaat 
agtctgt net tccagagtcc agagacattg 
15 gatgtggtct cattttgtca accacacatg 
tcttggtcac tagagcccca acctcggaca 
ggcaattatg tcttcctttg aaaagtcatg 

20 

atttcttcac ccagggcatg gtaataacct 

ccatgctggc ttccaaaatg ctgttctcat 

25 tagctccccc ttgcccagga tcaagtgcag 

gcttgactcc cacctcccac tccaccaagc 

ccctgcttcc ttagtttgcc atccacactt 
30 

aggattcaca ttacttgtca tctctt cccc 
gatccctctc tcctctgagg ttccagcccc 
35 ttctgttttc catttgacag tcattcatgg 
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actacaagtg ctgtacatat gctgagaaac 4860 
ctttttgaaa atcatttcca gacaacctct 4920 
aaaaaaagtt ttaaacagaa gcacangaca 4980 
tggcaattct tccacgtggg acttgtccac 5040 
aagttacata tttattttct cacttaagtt 5100 
gacaatgtta atattgcttt atgaattaac 5160 
ttaataaaga caatgaatca tgaccgaaag 5220 
acgtcatttc tgtcaaagtt gacacccttc 5280 
cacctttgac tgctctctgg tggcccttgt 5340 
tttatccctt cctttccaaa cccagaccgc 5400 
cagccttgta tccttttagc agcctcccct 5460 
tgtatcactc ccctgctcaa aagccttcca 5520 
tttccctatc tgadatggga ggccttctct 5580 
ttcctactga ctccaaatgg tcatgcagat 5640 
agcaccccca ataactaatc ctctttcttt 5700 
taaccttcca gagatgttcc aatctcccat 5760 
ttttgtctac accactactt tggtccctaa 5820 
aggaccagcc tggccaagtc ctgcttagta 5880 
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ctggcataga caacacaaag ccaagtacaa 
tcttcgaagt gtggatttga tgcctcctgg 
5 agcctcctgc acttctctca aagtctcgcc 
tagctgagtt agcatctcca gatgaagagt 
ggtgggatgg tcaactggaa agccttaaat 

10 

cccacaaaga aagctgaggt ccctcctgat 
ttgaattaat catcatcatc aagtctttga 
15 cttcctgggg ctccagcccc cgaggaggcc 
catgagtgtc tcagggccca gaatatgaga 
ctgagaggca gtgcggtcca gtgggtgggg 

20 

tgtttgtttg ttttgagaca gagtctcgct 

atctcggctt actgcaactt ctgccttccc 

25 cagagtagct gggattacag gtgcgtgcca 

agagacgggg tttcaccatg ttggccaggc 

gcctgcctcg gcctcccaaa gtgctgggat 
30 

ggttggtgtt tgaatctgag gagactgaag 
catacttggg ctcagttcct tgccctaccc 
35 acatgggcaa taaccaggtc acactgtttt 
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ttcaggacca gctcacagga aac^tcatct 5940 
gtagaaatgt aggatcttca aaagtgggcc 6000 
cccccaaggt gtcttaatag tgctggatgc 6060 
aaccctaaag ttactcttca gttgccctaa 6120 
taagtccagc ctaccttggg ggaacccacc 6180 
gacttgtcag tttaactacc aataacccac 6240 
taggtgtgag tgggtatcag tggccggtcc 6300 
tcagtgagcc cctgcagaaa atccatgcat 6360 
gcaggtagaa aacagagaca tcttccatcc 6420 
acacgggctc tgggtcaggt ctgtgttgtt 6480 
ctattgccca ggctggagtg cagtgtcaca 6540 
ggattcaagt gattctcctg cctcagcctc 6600 
ccacgcctgg ctaatttttg tatttttgat 6660 
tagtctcgaa ctcttgacct caagtgatct 6720 
tacaggcgtg agccaccaca cccagcccca 6780 
caccaagggg ttaaatgttt tgcccacagc 6840 
ctcacttgag ctgcttagaa cctggtgggc 6900 
gtaccaagtg ttatgggaat ccaagatagg 6960 
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agtaatttgc tctgtggagg ggatgaggga 
tgttgcttag agattttcca ggtggagaag 
5 caaagactgc aagtgcatgg ctgctcatgg 
taccgagtcc ttgccatgtg caaggcaaca 
aaacctaggg tctgctttct gggacctgaa 

10 

cccatcacca cgagggggaa aaaaacctga 
tttatggaaa gttatattct acctacatgg 
15 gaacaaacaa cagacctagc tgggaggggc 
ttctgggggt acagccagac tcagggcttg 
gggatagaag gaaataggtc cctttctctc 

20 

tctctcacac acacacacag acacacacac 

gtacaaatca ggccacattt acacaaggag 

25 ggaccccaaa attccctgtt ttccttgaat 

agagcctgca gaagccacct gcgagtaagc 

gtgccatgtc ccacacccgc ccacctccca 
30 

aacccggctc ccagatcatt gattatagct 
ccccacccca ttctcggaac ctgccctctg 
35 ctcagctatt gggcagcttt gaccaacagc 



94 

tagtggttag ggaaagcttc acaaagtggg 7020 
ggggcttcta ggcagaaggc atagcccaag 7080 
gtagaagaga atccaccatt cctcaacatg 7140 
tgggggtacc aggaatt cca agcaatgtcc 7200 
gatacaggat ggatcagccc aggctgcaat 7260 
aggctaaatt gtaggtcggg tcagaggtta 7320 
ggtctataag cctggcgcca atcagaaaag 7380 
agcattttct tgtagggggc ggggcacatg 7440 
tattaatagt ctgagagtaa gacagacaga 7500 
tctctctctc tctctctctc actctctctc 7560 
acgctctgta ggggtctact tatgctccaa 7620 
gtaaagaaaa agaacgttgg aggagccaca 7680 
caggcaggac ttacgcagct gggagggtgg 7740 
caagttcaga gtcacagaca ccaaaagctg 7800 
cctgctcctt gacacagccc tgtgctccac 7860 
ctggggcctg caccgtcctt cctgccacat 7920 
tcttctccct tgtccaaggg caggcaaggg 7980 
tgaggctcct tttgtggctg gagatgcagg 8040 
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aggcagggga atattcctct tagccaatgc gaccatgtgc ctggtttgcc cagggtggtc 9100 

tcgtttacac ctgtaggcca agcgtaatta ttaacagctc ccacttctac tctaaaaaat 8160 

5 gacccaatct gggcagtaaa ttatatggtg cccatgctat taagagctgc aacttgctgg 8220 

gcgtggtggc tcacacctgt. aatcccagta ctttgggacg tcaaggcggg tggatcacct 8280 

gaggtcacga gttagagact ggcctggcca gcatggcaaa accccatctt tactaaaaat 8340 

10 

acaaaaatta gcaaggcatg gtggcatgca cctgtaatcc caggtactcg ggaggctgag 8400 

acaggagaat ggcttgaacc caggaggcag aggttgcagt gagccaagat tgtgccactg 8460 

15 ccctccagcc ctggcaacag agcaagactt catzctcaaaa gaaaaaggat actgtcaatc 8520 

actgcaggaa gaacccaggt aatgaatgag gagaagagag gggctgagtc accacagtgg 8580 

cagcaccgac tcctgcagga aaggcgagac actgggtcat gggtactgaa gggtgccctg 8640 

20 

aatgacgttc tgctttagag accgaacctg agccctgaaa gtgcatgcct gttcatgggt 8700 

gagagactaa attcatcatt ccttggcagg tactgaatcc tttcttacgg ctgccctcca 8760 

25 atgcccaatt tccctacaat tgtctggggt gcctaagctt ctgcccacca agagggccag 8820 

agcrtggcagc gagcagctgc aggtaggaga gataggtacc cataagggag gtgggaaaga 8880 

gagatggaag gagaggggtg cagagcacac acctcccctg cctgacaact tcctgagggc 8940 

30 

tggtcatgcc agcagattta aggcggaggc aggggagatg gggcgggaga ggaactgaaa 9000 

aaggagaggg tggggatgga gaggaagaga gggtgatcat tcattcattc cattcctact 9060 

35 gactggatgc cagctgtgag ccaggcacca ccctagctct gggcatgtgg ttgtaatctt 9120 
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ggagcctcar ggagctcaca gggagtgctg gcaaggagat ggataataga cggataacaa 9180 

ataaacattt agtacaatgt ccgggaatgg aaagttctcg aaagaaaaat aaagctggtg 9240 

5 agcatataga cagccctgaa ggcggccagg ccaggcattt ctgaggaggt ggcatttgag 9300 

c 9301 



10 From the foregoing, it will be appreciated that, although specific 

embodiments of the invention have been described herein for purposes of illustration, 
various modifications may be made without deviating from the spirit and scope of the 
invention. Accordingly, the invention is not limited except as by the appended claims. 
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CLAIMS 

We claim: 

1. An isolated nucleic acid molecule selected from the group 

consisting of: 

5 (a) an isolated nucleic acid molecule comprising sequence ID Nos., 

1, 5, 9, 1 1, 13, or, 15, or complementary sequence thereof; 

(b) an isolated nucleic acid molecule that specifically hybridizes to 
the nucleic acid molecule of (a) under conditions of high stringency; and 

(c) an isolated nucleic acid that encodes a TGF-beta binding-protein 
10 according to (a) or (b). 

2. The isolated nucleic acid molecule according to claim 1 wherein 
said nucleic acid molecule encodes a protein comprising the protein of Sequence ID 
NO. 2. 

3. The isolated nucleic acid molecule according to claim 1 wherein 
15 said nucleic acid molecule encodes a protein comprising the protein of Sequence ID 

NO. 6. 

4. The isolated nucleic acid molecule according to claim 1 wherein 
• said nucleic acid molecule encodes a protein comprising the protein of Sequence ID 

NO. 10. 

20 5. The isolated nucleic acid molecule according to claim 1 wherein 

said nucleic acid molecule encodes a protein comprising the protein of Sequence ID 
NO. 12. 

6. The isolated nucleic acid molecule according to claim 1 wherein 
said nucleic acid molecule encodes a protein comprising the protein of Sequence ID 

25 NO. 14. 

7. The isolated nucleic acid molecule according to claim l wherein 
said nucleic acid molecule encodes a protein comprising the protein of Sequence ID 
NO. 16. 
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S. An expression vector, comprising a promoter operably linked to 
a nucleic acid molecule according to any one of claims 1 to 7. 

9. The expression vector according to claim 8 wherein said 
promoter is selected from the group consisting of CMV I-E promoter, SV40 early 

5 promoter and MuLV LTR. 

10. The expression vector according to claim 8 wherein said 
promoter is a tissue-specific promoter. 

11. A method of producing a TGF-beta binding protein, comprising, 
culturing a cell which contains a vector according to claim 8 under conditions and for a 

10 time sufficient to produce said protein. 

12. The method according to claim 11, further comprising the step of 
purifying said protein. 

13. A viral vector capable of directing the expression of a nucleic 
acid molecule according to any one of claims 1 to 7. 

I 5 14. The viral vector according to claim 13 wherein said vector is 

selected from the group consisting of herpes simplex viral vectors, adenoviral vectors, 
adenovirus-associated viral vectors and retroviral vectors. 

15. A host cell carrying a vector according to any one of claims 8 to 

14. 



16. The host cell according to claim 15 wherein said cell is selected 
from the group consisting of a human cell, dog cell, monkey cell, rat cell and mouse 
cell. 



17. An isolated protein, comprising a TGF-beta binding-protein 
encoded by the nucleic acid molecule according to any one of claims 1 to 7. 



25 



18. 



An antibody which specifically binds to the protein according to 



claim 17. 
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19. The antibody according to claim 18 wherein said antibody is a 
monoclonal antibody. 

20. The antibody according to claim 19 wherein said monoclonal 
antibody is a murine or human antibody. 

5 21. The antibody according to claim 18 wherein said antibody is 

selected from the group consisting of F(ab’)-; t F(ab),, Fab’, Fab, and Fv. 

22. A hybridoma which produces an antibody according to claim 19. 

23. A fusion protein, comprising a first polypeptide segment 
comprising a TGF-beta binding-protein encoded by the nucleic acid molecule 

10 according to any one of claims 1 to 7, or a portion thereof of at least 10 amino acids in 
length, and a second polypeptide segment comprising a non-TGF-beta binding-protein. 

24. The fusion protein according to claim 23 wherein said first 
polypeptide segment is at least 20 amino acids in length. 

25. The fusion protein according to claim 23 wherein said first 

15 polypeptide segment is at least 50 amino acids in length. 

26. The fusion protein according to claim 23 wherein said second 
polypeptide comprises multiple anionic amino acid residues. 

27. An isolated oligonucleotide which hybridizes to a nucleic acid 
molecule according to Sequence ID NOs. 1, 3, 5, 7, 9, 1 1, 13, or 15, or the complement 

20 thereto, under conditions of high stringency. 

28. The isolated oligonucleotide according to claim 27 wherein said 
oligonucleotide is at least 20 nucleotides in length. 

29. The isolated oligonucleotide according to claim 27 wherein said 
oligonucleotide is at least 30 nucleotides in length. 



25 



30. The isolated oligonucleotide according to claim 27 wherein said 
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oligonucleotide is at least 50 nucleotides in length. 

31. The isolated oligonucleotide according to claim 27 wherein said 
oligonucleotide is between 50 to 100 nucleotides in length. 

32. A pair of primers which specifically amplifies all or a portion of 
5 a nucleic acid molecule according to any one of claims 1 to 7. 

33. A ribozyme which cleaves RNA encoding a protein according to 

claim 17. 

34. The ribozyme according to claim 33 wherein said protein 
comprises the protein of Sequence ID NO. 2. 

10 35. The ribozyme according to claim 33 wherein said protein 

comprises the protein of Sequence LD NO. 6. 

36. The ribozyme according to claim 33 wherein said RNA encodes 
a protein comprising the protein of Sequence ID NO. 10. 

37. The ribozyme according to claim 33 wherein said RNA encodes 

15 a protein comprising the protein of Sequence ID NO. 12. 

38. The ribozyme according to claim 33 wherein said RNA encodes 
a protein comprising the protein of Sequence ID NO. 14. 

39. The ribozyme according to claim 33 wherein said RNA encodes 
a protein comprising the protein of Sequence ID NO. 16. 

20 40. The ribozyme according to claim 33 wherein said ribozyme is 

composed of ribonucleic acids. 

41. The ribozyme according to claim 40 wherein one or more of said 
ribonucleic acids are 2’-0-methyl ribonucleic acids. 

42. The ribozyme according to claim 33 wherein said ribozyme is 
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composed of a mixture of deoxyribonucleic acids and ribonucleic acids. 

43. The ribozyme according to claim 33 wherein said ribozyme is 
composed of nucleic acids having phosphothioate linkages. 

44. A nucleic acid molecule comprising a nucleic acid sequence 
5 which encodes a ribozyme according to claim 33. 

45. The nucleic acid molecule of claim 44, wherein the nucleic acid 
is DNA or cDNA. 

46. The nucleic acid molecule of claim 44, under the control of a 
promoter to transcribe the nucleic acid. 

10 47. A host cell comprising the ribozyme of claim 33. 

48. A vector, comprising the nucleic acid molecule of claim 44. 

49. The vector of claim 54, wherein the vector is a plasmid, a virus, 
retrotransposon or a cosmid. 

50. The vector of claim 49 wherein said virus is selected from the 
15 group consisting of retroviruses, adenoviruses, and adeno-associated viruses. 

51. A host cell containing the vector according to any one of claims 

48 to 50. 

52. The host cell according to claim 51 wherein said host cell is 
stably transformed with said vector. 

20 53. The host cell according to claim 51 wherein the host cell is a 

human cell. 

54. A method for producing a ribozyme, comprising providing DNA 
encoding the ribozyme according to claim 33 under the transcriptional control of a 
promoter, and transcribing the DNA to produce the ribozyme. 
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55. The method of claim 54 wherein the ribozyme is produced in 

vitro. 

56. The method of claim 54, further comprising purifying the 

ribozyme. 

5 57. A method for increasing bone mineralization, comprising 

introducing into a warm-blooded animal an effective amount of the ribozyme according 
to any one of claims 33 to 43. 

58. A method of increasing bone mineralization, comprising 
introducing into a patient an effective amount of the nucleic acid molecule of claim 44, 
10 under conditions favoring transcription of the nucleic acid molecule to produce a 
ribozyme. 



59. A pharmaceutical composition, comprising the ribozyme 
according to any one of claims 33 to 43, and a pharmaceutically acceptable carrier or 
diluent. 



15 60. A pair of primers capable of specifically amplifying all or a 

portion of a nucleic acid molecule according to any one claims 1 to 7. 

61. A method for detecting a nucleic acid molecule which encodes a 
TGF-beta binding protein, comprising incubating an oligonucleotide according to any 
one of claims 27 to 3 1 under conditions of high stringency, and detecting hybridization 

20 of said oligonucleotide. 

62. The method according to claim 61 wherein said oligonucleotide 

is labeled. 

63. The method according to claim 61 wherein said oligonucleotide 
is bound to a solid support. 

25 64. A method for detecting a TGF-beta binding protein, comprising 

incubating an antibody according to any one of claims 18 to 21 under conditions and 
for a time sufficient to permit said antibody to bind to a TGF-beta binding protein, and 
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detecting said binding. 

65. The method according to claim 64 wherein said antibody is 
bound to a solid support. 

66. The method according to claim 64 wherein said antibody is 

5 labeled. 

67. The method according to claim 66 wherein said antibody is 
labeled with a marker selected from the group consisting of enzymes, fluorescent 
proteins, and radioisotopes. 

68. A transgenic animal whose germ cells and somatic cells contain a 
10 nucleic acid molecule encoding a TGF-beta binding-protein according to claim 1 which 

is operably linked to a promoter effective for the expression of said gene, said gene 
being introduced into said animal, or an ancestor of said animal, at an embryonic stage, 
with the proviso that said animal is not a human. 

69. The transgenic animal according to claim 68 wherein TGF-beta 
15 binding-protein is expressed from a vector according to any one of claims 8 to 10. 

70. A transgenic knockout animal, comprising an animal whose 
germ cells and somatic cells comprise a disruption of at least one allele of an 
endogenous nucleic acid molecule which hybridizes to the nucleic acid molecule 
according to claim 1, wherein said disruption prevents transcription of messenger RNA 

20 from said allele as compared to an animal without said disruption, with the proviso that 
said animal is not a human. 

71. The transgenic animal according to claim 70 wherein said 
disruption is a nucleic acid deletion, substitution, or, insertion. 

72. The transgenic animal according to claim 68 or 70 wherein the 
25 animal is selected from the group consisting of a mouse, a rat and a dog. 
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73. A method for determining whether a candidate molecule is 
capable of increasing bone mineral content, comprising: 

(a) mixing one or more candidate molecules with TGF-beta-binding- 
protein encoded by the nucleic acid molecule according to any one of claims 1 to 7 and 

5 a selected member of the TGF-beta family of proteins; 

(b) determining whether the candidate molecule alters the signaling 
of the TGF-beta family member, or alters the binding of the TGF-beta binding-protein 

* to the TGF-beta family member. 

74. The method according to claim 73 wherein said member of the 

10 TGF-beta family of proteins is BMP6. 

75. A method for determining whether a candidate molecule is 
capable of increasing bone mineral content, comprising: determining whether a 
candidate molecule inhibits the binding of TGF-beta binding-protein to bone, or an 
analogue thereof. 

15 76. The method according to claim 75 wherein said analogue of bone 

is hydroxyapatite. 

77. A kit for detection of TGF-beta binding-protein gene expression, 
comprising a container that comprises a nucleic acid molecule, wherein said nucleic 
acid molecule is selected from the group consisting of (a) a nucleic acid molecule 

20 comprising the nucleotide sequence of SEQ ID NO; 1, 5, 7, 9, 11,. 13, or 15; (b) a 
nucleic acid molecule comprising the complement of the nucleotide sequence of (a); (c) 
a nucleic acid molecule that is a fragment of (a) or (b) of at least 20 nucleotides in 
length. 

78. A kit for detection of TGF-beta binding-protein, 

25 comprising a container that comprises an antibody according to any one of claims 18 to 

21 . 

79. An antisense oligonucleotide, comprising a 
nucleic acid molecule which hybridizes to a nucleic acid molecule according to 
Sequence ID NOs. 1, 3, 5, 7, 9, 1 1, 13, or 15, or the complement thereto, and wherein 

30 said oligonucleotide inhibits the expression of TGF-beta binding protein according to 
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claim 17. 

80. The oligonucleotide according to claim 79 

wherein said oligonucleotide is 15 nucleotides in length. 

81. The oligonucleotide according to claim 79 

5 wherein said oligonucleotide is 20 nucleotides in length. 

82. The oligonucleotide according to claim 79 

wherein said oligonucleotide is 50 nucleotides in length. 

83. The oligonucleotide according to claim 79, 

wherein said oligonucleotide is comprised of one or more nucleic acid analogs. 

10 84. The oligonucleotide according to claim 79, 

wherein said oligonucleotide is comprised of one or more ribonucleic acids. 

85. The oligonucleotide according to claim 79, 

wherein said oligonucleotide is comprised of one or more deoxyribonucleic acids. 

86. The oligonucleotide according to claim 79 

15 . wherein said oligonucleotide sequence comrpises one or more modified covalent 

linkages. 

87. The oligonucleotide according to claim 86 

wherein said modified covalent linkage is selected from the group consisting of a 
phosphorothioate linkage, a phosphotriester linkage, a methyl phosphonate linkage, a 

20 methylene(methylimino) linkage, a morphoiino linkage, an amide linkage, a polyamide 
linkage, a short chain alkyl intersugar linkage, a cycloalkyl intersugar linkage, a short 
chain heteroatomic intersugar linkage and a heterocyclic intersugar linkage. 
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human_cerberus . pro 
human_dan . pro 
human_beer . pro 



human_gremlin . pro 
human_cerberus . pro 
human_dan . pro 
human_beer . pro 



human_gremlin . pro 
human_cerberus . pro 
human_dan . pro 
human_beer . pro 



human_gremlin .pro 
human_cerberus . pro 
human_dan . pro 
human_beer . pro 



101 150 

AI.PPPDKAQ HNDSEQTQSP QQPGSRNRGR GQGRGTAMPG EEVLESSQEA 
SDSEPFPPGT QSLIQPID.G MKMEKSPLRE EAKKFWHHFM FRKTPASQGV 

MLRVLVGAVXj PAMLLAAPPP 

AFRWEGGGW QAFKNDATEI IPELGEYPEP PPELENNKTM NRAENGGRPP 



151 

LHVTERKYLK 

ILPIKSHEVH 

INKLALFPDK 

HHPFETKDVS 



RDWCKTQPLK 

WETCRTVPFS' 

SAWCEAKNIT 

EYSCRELHFT 



QTIHEEGCNS 

QTITHEGCEK 

QIVGHSGCEA 

RYVTDGPCRS 



RTIINRF . CY 
VWQNNL.CF 
KSIQNRA . CL 
AKPVTELVCS 



\J/ 200 

GQCNSFYIPR 
GKCGSVHFP . 
GQCFSYSVPN 
GQCGPARLLP 



201 

HIRKEEGSFQ 
. . GAAQHSHT 
TFPQSTESLV 
NAIGRGKWWR 



SCSF . . . CKP 
SCSH . . .CLP 
HCDS . . . CMP 
PSGPDFRCIP 



KKFTTMMVTL 

AKFTTMHLPL 

AQSMWEIVTL 

DRYRAQRVQL 



NCPELQPPTK 

NCTELSSVIK 

ECPGHEEVPR 

LCPGGEAPRA 



250 

K . KRVTRVKQ 
V. . . VMLVEE 

vd'klvekilh 

RKVRLVAS . . 



CRC . ISIDLD 
CQCKVKTEHE 
CSCQACGKEP 
CKCKRLTRFH 



300 



DQHILHAGSQ DSFIPGVSA- 

skeglsvyvq gedgpgsqpg thphphphph pggqtpeped 
NQSELKDFGT EAARPQKGRK PRPRARSAKA NQAELENAY- 



301 314 

human_greml in . pro 

human_cerberus . pro 

human_dan . pro PPGAPHTEEE GAED 
human_beer . pro 
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*Anti BMP-5 western 

Fig. 6 
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SEQUENCE LISTING 



<110> Brunkow, Mary E. 
Galas, David J. 
Kovacevich, Brian 
Mulligan, John T. 
Paeper, Bryan W. 
Van Ness, Jeffrey 
Winkler, David G. 



<12 0> COMPOSITIONS AND METHODS FOR INCREASING 
BONE MINERALIZATION 

<13 0> 240083 . 503 

< 14 0> US 

< 14 1 > IS 99 - 11-24 
<160> 41 

<170> FastSEQ for Windows Version 3.0 

< 2 1 0 > 1 
<211> 2301 
<2 12 > DNA 
<213> Homo sapien 

<400> 1 

agagcctgtg ctactggaag gtggcgtgcc ctcctctggc tggtaccatg cagctcccac 
tggccctgtg tctcgtctgc ctgctggtac acacagcctt ccgt'gtagtg gagggccagg. 
ggtggcaggc gttcaagaat gatgccacgg aaatcatccc cgagctcgga gagtaccccg 
agcctccacc ggagctggag aacaacaaga ccatgaaccg ggcggagaac ggagggcggc 
ctccccacca ccccttcgag accaaagacg tgtccgagta cagctgccgc gagctgcact 
tcacccgcta cgtgaccgat gggccgtgcc gcagcgccaa gccggccacc gagccggtgt 



60 

120 

180 

240 

300 



360 
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2 



gctccggcca 


gtgcggcccg 


gcgcgcctgc 


tgcccaacgc 


catcggccgc 


ggcaagtggt 


420 


ggcgacctag 


tgggcccgac 


ttccgctgca 


tccccgaccg 


ctaccgcgcg 


cagcgcgtgc 


480 


agccgctgtg 


tcccggtggt 


gaggcgccgc 


gcgcgcgcaa 


ggtgcgcctg 


gtggcctcgt 


540 


gcaagtgcaa 


gcgcctcacc 


cgcttccaca 


accagtcgga 


gctcaaggac 


ttcgggaccg 


600 


aggccgctcg 


gccgcagaag 


ggccggaagc 


cgcggccceg 


cgcccggagc 


gccaaagcca 


660 


accaggccga 


gctggagaac 


gcctactaga 


gcccgcccgc 


gcccctcccc 


accggcgggc 


720 


gccccggccc 


tgaacccgcg 


ccccacattt 


ctgtcctctg 


cgogt'.ggr.tt: 


gattgtttat: 


780 


atttcattgt 


aaatgcctgc 


aacccagggc 


agggggctga 


gaccttccag 


gccctgagga 


840 


atcccgggcg 


ccggcaaggc 


ccccctcagc 


ccgccagctg 


^ggggcccca 


cggggcaggg 


900 


gagggaattg 


agagtcacag 


acactgagcc 


acgcagcccc 


gcctctgggg 


ccgcctacct 


960 


ctgctggtcc 


cacr.tcagag 


gaggcagaaa 


tggaagcatt 


tticaccgccc 


t ggggtr.tr. a 


1020 


agggagcggt 


gtgggagtgg 


gaaagtccag 


ggactggtta 


agaaagttgg 


ataagattcc 


1080 


cccttgcacc 


tcgctgccca 


tcagaaagcc 


tgaggcgtgc 


ccagagcaca 


agactggggg 


1140 


caactgtaga 


tgtggtttct 


agtcctggct 


ctgccactaa 


cttgctgtgt 


aaccttgaac 


1200 


tacacaattc 


tccttcggga 


cctcaatutc 


cactttgtaa 


aatgagggtg 


gaggtgggaa 


1260 


taggatctcg 


aggagactat 


tggcacauga 


ctccaaggac 


tccagtgcct 


tctgaatggg 


1320 


cagaggtgag 


agagagagag 


agaaagagag 


agaatgaatg 


cagttgcatt 


gattcagtcc 


1380 


caaggtcact 


tccagaattc 


agagttgcga 


tgccctcttc 


tgacagccaa 


agatgaaaaa 


1440 


caaacagaaa 


aaaaaaagta 


aagagtctat 


ttatggctga 


catatttacg 


gctgacaaac 


1500 


tcctggaaga 


agctacgctg 


cttcccagcc 


tggcttcccc 


ggatgtttgg 


ctacctccac 


1560 


ccctccatct 


caaagaaata 


acatcatcca 


ttggggtaga 


aaaggagagg 


gtccgagggt 


1620 


ggcgggaggg 


atagaaatca 


catccccccc 


aacttcccaa 


agagcagcat 


ccctcccccg 


1680 


acccatagcc 


atgttttaaa 


gtcaccttcc 


gaagagaagt 


gaaaggttca 


aggacactgg 


1740 


ccttgcaggc 


ccgagggagc 


agccatcaca 


aactcacaga 


ccagcacatc 


ccttttgaga 


1800 


caccgccttc 


tgcccaccac 


tcacggacac 


atttctgcct 


agaaaacagc 


ttcttactgc 


1860 


tcttacatgt: 


gatggcatat 


cttacactaa 


aagaatatta 


ttgggggaaa 


aactacaa gt 


1920 


gctgtacata 


tgctgagaaa 


ctgcagagca 


taatagctgc 


cacccaaaaa 


tctttttgaa 


1980 


aatcatttcc 


agacaacctc 


ttactttctg 


. tgtagttttt 


aautgttaaa 


aaaaaaaagt 


204 0 


tttaaacaga 


agcacatgac 


atatgaaagc 


ctgcaggact 


ggccgttttt 


ttggcaactc 


2100 


ttccacgtgg 


gacttgtcca 


caagaatgaa 


agtagtggtt 


tttaaagagt 


taagtcacat 


2160 


atttattttc 


tcacttaagt 


tatttatgca 


aaagtttttc 


ttgtagagaa 


tgacaatgtt 


2220 


aatattgctt 


tatgaattaa 


cagtctgttc 


ttccagagtc 


cagagacaot 


gttaataaag 


2280 


acaatgaatc 


atgaccgaaa 


g 








2301 
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<212> PRT 
<213> Homo sapien 

<400> 2 

5 Met Gin Leu Pro Leu Ala Leu Cys Leu Val Cys Leu Leu Val His Thr 
15 10 15 

Ala Phe Arg Val Val Glu Gly Gin Gly Trp Gin Ala Phe Lys Asn Asp 
20 25 30 

Ala Thr Glu lie lie Pro Glu Leu Gly Glu Tyr Pro Glu Pro Pro Pro 

10 35 40 45 

Glu Leu Glu Asn Asn Lys Thr Met Asn Arg Ala Glu Asn Gly Gly Arg 

50 55 60 

Pro Pro His His Pro Phe Glu Thr Lys Asp val Ser Glu Tyr Ser Cys 
65 .70 75 80 

15 Arg Glu Leu His Phe Thr Arg Tyr Val Thr Asp Gly Pro Cys Arg Ser 

85 90 95 

Ala Lys Pro Val Thr Glu Leu Val Cys Ser Gly Gin Cys Gly Pro Ala 
100 105 110 

Arg Leu Leu Pro Asn Ala lie Gly Arg Gly Lys Trp Trp Arg Pro Ser 

20 115 120 125 

Gly Pro Asp Phe Arg Cys He Pro Asp Arg Tyr Arg Ala Gin Arg Val 

130 135 140 

Gin Leu Leu Cys Pro Gly Gly Glu Ala Pro Arg Ala Arg Lys Val Arg 
145 150 155 160 

25 Leu Val Ala Ser Cys Lys Cys Lys Arg Leu Thr Arg Phe His Asn Gin 

165 170 175 

Ser Glu Leu Lys Asp Phe Gly Thr Glu Ala Ala Arg Pro Gin Lys Gly 
180 185 190 

Arg Lys Pro Arg Pro Arg Ala Arg Ser Ala Lys Ala Asn Gin Ala Glu 

30 195 200 205 

Leu Glu Asn Ala Tyr 
210 

<2 10> 3 

35 <211> 2301 

<212> DMA 
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4 



<2X3 > Homo sapien 



<400> 3 



agagcctgtg 


ctactggaag 


gtggcgtgcc 


ctcctctggc 


tggtaccatg 


cagctcccac 


60 


tggccctgtg 


tctcgtctgc 


ctgccggtac 


acacagcctt 


ccgtgtagtg 


gagggctagg 


120 


ggcggcaggc 


gttcaagaat 


gatgccacgg 


aaatcatccc 


cgagctcgga 


gagtaccccg 


180 


agcccccacc 


ggagctggag 


aacaacaaga 


ccatgaaccg 


ggcggagaac 


ggagggcggc 


240 


ctccccacca 


cccctttgag 


accaaagacg 


tgtccgagta 


cagctgccgc 


gagctgcact 


300 


tcacccgcca 


cgcgaccgat 


gggccgtgcc 


gcagcgccaa 


gccggtcacc 


gagccggtgt 


360 


gctccggcca 


gtgcggcccg 


gcgcgcctgc 


tgcccaacgc 


catcggccgc 


ggcaagtggt 


420 


ggcgacctag 


tgggcccgac 


ttccgctgca 


tccccgaccg 


ctaccgcgcg 


cagcgcgtgc 


480 


agctgctgtg 


tcccggtggt 


gaggcgccgc 


gcgcgcgcaa 


ggtgcgcctg 


gtggcctcgt 


540 


gcaagtgcaa 


gcgcctcacc 


cgcttccaca 


accagtcgga 


gctcaaggac 


ttcgggaccg 


600 


aggccgctcg 


gccgcagaag 


ggccggaagc 


cgcggccccg 


cgcccggagc 


gccaaagcca 


660 


accaggccga 


gctggagaac 


gcctactaga 


gcccgcccgc 


gcccctcccc 


accggcgggc 


720 


gccccggccc 


tgaacccgcg 


ccccacattt: 


ctgtcctctg 


cgcgtggttt 


gattgtttat 


780 


atttcattgt 


aaatgcctgc 


aacccagggc 


agggggctga 


gaccttccag 


gccctgagga 


840 


atcccgggcg 


ccggcaaggc 


ccccctcagc 


ccgccagctg 


aggggtccca 


cggggcaggg 


900 


gagggaattg 


agagtcacag 


acactgagcc 


acgcagcccc 


gcctctgggg 


ccgcctacct 


960 


ttgctggccc 


cacttcagag 


gaggcagaaa 


tggaagcatt 


tccaccgccc 


nggggtctta 


1020 


agggagcggt 


gtgggagtgg 


gaaagtccag 


ggactggtta 


agaaagttgg 


ataagattcc 


1080 


cccccgcacc 


tcgctgccca 


tcagaaagcc 


tgaggcgtgc 


ccagagcaca 


agaccggggg 


1140 


caactgtaga 


tgtggtttct 


agtcctggct 


ctgccactaa 


cttgctgtgt 


aaccttgaac 


1200 


tacacaattc 


tccttcggga 


cctcaatttc 


cactttgtaa 


aatgagggtg 


gaggtgggaa 


1260 


taggatctcg 


aggagactat 


tggcatatga 


ttccaaggac 


tccagtgcct 


tttgaatggg 


1320 


cagaggtgag 


agagagagag 


agaaagagag 


agaatgaatg 


cagttgcatt 


gattcagtgc 


1380 


caaggtcact 


tccagaattc 


agagttgtga 


tgctctcttc 


tgacagccaa 


agatgaaaaa 


1440 


caaacagaaa 


aaaaaaagta 


aagagtctat 


ttatggccga 


catatttacg 


gctgacaaac 


1500 


tcctggaaga 


agctatgctg 


cttcccagcc 


tggcttcccc 


ggatgtttgg 


ctacctccac 


1560 


ccctccatct 


caaagaaata 


acatcatcca 


ttggggtaga 


aaaggagagg 


gtccgagggt 


1620 


ggtgggaggg 


atagaaatca 


catccgcccc 


aacttcccaa 


agagcagcat 


ccctcccccg 


1680 


acccacagcc 


atgctttaaa 


gtcaccttcc 


gaagagaagt 


gaaaggttca 


aggacaccgg 


1740 


ccttgcaggc 


ccgagggagc 


agccatcaca 


aactcacaga 


ccagcacatc 


ccttttgaga 


1800 


caccgccttc 


trgcccaccac 


tcacggacac 


act tctgcct 


agaaaacagc 


ttcccactgc 


1860 


tcttacatgt 


gatggcatac 


cttacactaa 


aagaatatta 


ttgggggaaa 


aactacaagt 


1920 


gctgtacata 


tgctgagaaa 


ctgcagagca 


taatagctgc 


cacccaaaaa 


tcttcttgaa 


1980 
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5 



aatcatttcc 


agacaacctc 


ttactttctg 


tgtagttttt 


aattgttaaa 


aaaaaaaagt 


2040 


tttaaacaga 


agcacatgac 


atatgaaagc 


ctgcaggact 


ggtcgttttt 


ttggcaattc 


2100 


ttccacgtgg 


gacttgtcca 


caagaatgaa 


agtagtggtt 


tttaaagagt 


taagttacat 


2160 


atttattttc 


tcacttaagt 


tatttatgca 


aaagtttttc 


ttgtagagaa 


tgacaatgtt 


2220 


aatattgctt 


tatgaattaa 


cagtctgttc 


ttccagagtc 


cagagacatt 


gttaataaag 


2280 


acaatgaatc 


atgaccgaaa 


g 








2301 



<210> 4 

<211> 23 

<2 12 > PRT 

<213 > Homo sapien 

<4 00> 4 

Met Gin Leu Pro Leu Ala Leu Cys Leu Val Cys Leu Leu Val His Thr 



1 


5 




10 




15 




Ala Phe Arg 


Val Val Glu Gly 
20 










<210> 


5 












<21 1> 


2301 












<212> 


DNA 












<213> 


Homo sapien 










<4 00> 


5 












agagcctgtg 


ctactggaag 


gtggcgtgcc 


ctcctctggc 


tggtaccatg 


cagctcccac 


60 


tggccctgtg 


tctcatctgc 


ctgctggtac 


acacagcctt 


ccgtgtagtg 


gagggccagg 


120 


ggtggcaggc 


gttcaagaat 


gatgccacgg 


aaatcatccg 


cgagctcgga 


gagtaccccg 


180 


agcctccacc 


ggagctggag 


aacaacaaga 


ccatgaaccg 


ggcggagaac 


ggagggcggc 


240 


ctccccacca 


cccc tttgag 


accaaagacg 


tgtccgagta 


cagctgccgc 


gagctgcact 


300 


tcacccgcta 


cgtgaccgat 


gggccgtgcc 


gcagcgccaa 


gccggtcacc 


gagctggtgt 


360 


gctccggcca 


gtgcggcccg 


gcgcgcctgc 


tgcccaacgc 


catcggccgc 


ggcaagtggt 


420 


ggcgacctag 


tgggcoogao 


ttccgctgca 


tccccgaccg 


ctaccgcgcg 


cagcgcgtgc 


480 


agctgctgtg 


tcccggtggt 


gaggcgccgc 


gcgcgcgcaa 


ggtgcgcctg 


gtggcctcgt 


540 


gcaagtgcaa 


gcgcctcacc 


cgcttccaca 


accagtcgga 


gctcaaggac 


ttcgggaccg 


600 


aggccgctcg 


gccgcagaag 


ggccggaagc 


cgcggccccg 


cgcccggagc 


gccaaagcca 


660 


accaggccga 


gctggagaac 


gcctactaga 


gcccgcccgc 


gcccctcccc 


accggcgggc 


720 
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6 

gccccggccc tgaacccgcg ccccacattt ctgtcctctg cgcgtggttt gattgtttat 780 

atttcattgt aaatgcctgc aacccagggc agggggctga gaccttccag gccctgagga 840 

atcccgggcg ccggcaaggc ccccctcagc ccgccagctg aggggtccca cggggcaggg 900 

gagggaattg agagtcacag acactgagcc acgcagcccc gcctctgggg ccgcctacct 960 

5 ttgctggtcc cacttcagag gaggcagaaa tggaagcatt ttcaccgccc tggggtttta 1020 

agggagcggt gtgggagtgg gaaagtccag ggactggtta agaaagttgg ataagattcc 1080 

ccctcgcacc tcgctgccca tcagaaagcc tgaggcgtgc ccagagcaca agactggggg 1140 

caactgtaga tgtggtttct agtcctggct ctgccactaa cttgctgtgt aaccttgaac 1200 

tacacaattc tccttcggga cctcaatttc cactttgtaa aatgagggtg gaggtgggaa 1260 

10 taggatctcg aggagactat tggcatatga ttccaaggac tccagtgcct tttgaatggg 1320 

cagaggtgag agagagagag agaaagagag agaatgaatg cagttgcatt gattcagtgc 1380 

caaggtcact tccagaattc agagttgtga tgctctcttc tgacagccaa agatgaaaaa 1440 

caaacagaaa aaaaaaagta aagagtctat ttatggctga catatttacg gctgacaaac . 1500 

tcctggaaga agctatgctg cttcccagcc tggcttc ccc ggatgtttgg ctacctccac 1560 

15 cccrccatct caaagaaata acatcatcca ttggggtaga aaaggagagg gtccgagggt 1620 

99t999 a 999 atagaaatca catccgcccc aacttcccaa agagcagcat ccctcccccg 1680 

acccatagcc atgttttaaa gtcaccttcc gaagagaagc gaaaggttca aggacactcg 1740 

ccttgcaggc ccgagggagc agccatcaca aactcacaga ccagcacatc ccttttgaga 1800 

caccgccttc tgcccaccac tcacggacac atttctgcct agaaaacagc ttcttactgc i860 

20 tcttacatgt gatggcatat cttacactaa aagaatatta ttgggggaaa aactacaact 1920 

gctgtacata tgctgagaaa ctgcagagca taatagctgc cacccaaaaa tctttttgaa 1980 

aatcatttcc agacaacctc ttactttctg tgtagttttt aattgttaaa aaaaaaaagt 2040 

tttaaacaga agcacatgac atatgaaagc ctgcaggact ggtcgttttt ttggcaattc 2100 

ttccacgtgg gacttgtcca caagaatgaa agtagtggtt tttaaagagt taagttacat 2160 

25 atttattttc tcacttaagt tatttatgca aaagtttttc ttgtagagaa tgacaatgtt 2220 

aatattgctt tatgaattaa cagtctgttc ttccagagtc cagagacatt gttaataaag 2200 

acaatgaatc atgaccgaaa g 2301 

< 210 > 6 

30 <2ll> 213- 

<212> PR7 
<213> Homo sapien 



<400> 6 

35 Met Gin Leu Pro Leu Ala Leu Cys Leu He Cys Leu Leu Val His Thr 



l 



5 



10 



15 




. WO 00/32773 
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7 



Ala 


Phe 


Arg 


Val 


Val 


Glu Gly 


Gin 


Gly 


Trp 


Gin 


Ala 


Phe 


Lys 


Asn 


Asp 








20 










25 










30 






Ala 


Thr 


Glu 


lie 


lie 


Arg 


Glu 


Leu 


Gly 


Glu 


Tyr 


Pro 


Glu 


Pro 


Pro 


Pro 






35 










40 










45 








Glu 


Leu 


Glu 


Asn 


Asn 


Lys 


Thr 


Met 


Asn 


Arg 


Ala 


Glu 


Asn 


Gly 


Gly 


Arg 




50 










55 










60 










Pro 


Pro 


His 


His 


Pro 


Phe 


Glu 


Thr 


Lys 


Asp 


val 


Ser 


Glu 


Tyr 


Ser 


Cys 


65 










70 










75 










30 


Arg 


Glu 


Leu 


His 


Phe 


Thr 


Arg 


Tyr 


Val 


Thr 


Asp Gly 


Pro 


Cys 


Arg 


Ser 










85 










90 










95 




Ala 


Lys 


Pro 


Val 


Thr 


Glu 


Leu 


Val 


Cys 


Ser 


Gly Gin 


Cys 


Gly 


Pro 


Ala 








100 










105 










110 






Arg 


Leu 


Leu 


Pro 


Asn 


Ala 


He 


Gly 


Arg 


Gly 


Lys 


Trp 


Trp 


Arg 


Pro 


Ser 






115 










120 










125 








Gly 


Pro 


Asp 


Phe 


Arg 


Cys 


lie 


Pro 


Asp 


Arg 


Tyr 


Arg 


Ala 


Gin 


Arg 


Val 




130 










135 










140 










Gin 


Leu 


Leu 


Cys 


Pro 


Gly Gly 


Glu 


Ala 


Pro 


Arg 


Ala 


Arg 


Lys 


Val 


Arg 


145 










150 










155 










16 0 


Leu 


Val 


Ala 


Ser 


Cys 


Lys 


Cys 


Lys 


Arg 


Leu 


Thr Arg 


Phe 


His 


Asn 


Gin 










165 










170 










175 




Ser 


Glu 


Leu 


Lys 


Asp 


Phe 


Gly 


Thr 


Glu 


Ala 


Ala Arg 


Pro 


Gin 


Lys 


Gly 








180 










185 










190 






Arg 


Lys 


Pro 


Arg 


Pro 


Arg 


Ala 


Arg 


Ser 


Ala 


Lys 


Ala 


Asn 


Gin 


Ala 


Glu 



195 200 205 



25 Leu Glu Asn Ala Tyr 
210 

<2 10> 7 
<211> 2301 
30 <212> DNA 

<2 l 3> Homo sapien 

<4 00 > 7 

agagcctgtg ctactggaag gtggcgtgcc ctcctctggc tggtaccatg cagctcccac 
35 cggccctgtg tctcgtccgc ctgctggtec acacagcctc ccgtgtagtg gagggccagg 
ggtggcaggc gttcaagaat gatgccacgg aaatcatccc cgagctcgga gagtaccccg 



60 

120 



180 
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agcctccacc ggagctggag aacaacaaga ccatyaaccg ggcggagaac ggagggcggc 240 

ctccccacca cccctttgag accaaagacg tgtccgagta cagctgccgc gagctgcact 300 

tcacccgcta cgtgaccgat gggccgtgcc gcagcgccaa gccggtcacc gagctggtgt 360 

gctccggcca gtgcggcccg gcgcgcctgc tgcccaacgc catcggccgc ggcaagtggt 420 

5 ggcgacctag tgggcccgac ttccgctgca tccccgaccg ctaccgcgcg cagcgcgtgc 480 

agctgctgtg tcccggtggt gaggcgccgc gcgcgcgcaa ggtgcgcctg gtggccccgt 540 

gcaagtgcaa gcgcctcacc cgcttccaca accagtcgga gcticaagyac ttcgggaccg 600 

aggccgctcg gccgcagaag ggccggaagc cgcggccccg cgcccggagc gccaaagcca 660 

accaggccga gctggagaac gcctactaga gcccgcccgc gcccctcccc accggcgggc 720 

10 gccccggccc cgaacccgcg ccccacattt ctgtcctctg cgcgtggttt gattgttcat 780 

atttcattgt aaatgcctgc aacccagggc agggggctga gaccttccag gccctgagga 840 

atcccgggcg ccggcaaggc cccccccagc ccgccagctg aggggtccca cggggcaggg 900 

gagggaattg agagtcacag acactgagcc acgcagcccc gcctctgggg ccgcctacct 960 

tcgctggtcc cacttcagag gaggcagaaa tggaagcatt ttcaccgccc tggggttcta 1020 

15 agggagcggt gtgggagtgg gaaagtccag ggactggtta agaaagttgg ataagatucc 1080 

cccttgcacc tcgctgccca tcagaaagcc tgaggcgtgc ccagagcaca agactggggg 1140 

caactgtaga tgtggtttct agtcctggct ctgccactaa cttgctgtgt aaccttgaac 1200 

tacacaattc tccttcggga cctcaatttc cactttgtaa aatgagggtg gaggtgggaa 1260 

taggatctcg aggagactat tggcatatga ttccaaggac tccagtgcct tttgaatggg 1320 

20 cagaggtgag agagagagag agaaagagag agaatgaatg cagttgcatt gatucagagc 1380 

caaggtcact tccagaattc agagttgtga tgctctcttc tgacagccaa agatgaaaaa 1440 

caaacagaaa aaaaaaagta aagagtctat ttatggctga catatttacg gctgacaaac 1500 

tcctggaaga agctatgctg cttcccagcc tggcttcccc ggatgtttgg ctacctccac 1560 

ccctccatct caaagaaata acatcatcca ttggggtaga aaaggagagg gtccgagggt 1620 

25 ggtgggaggg atagaaatca catccgcccc aacttcccaa agagcagcat ccctcccccg 1680 

acccatag cc atgttttaaa gtcaccttcc gaagagaagt gaaaggttca aggacacugg 1740 

cctcgcaggc ccgagggagc agccatcaca aactcacaga ccagcacatc ccttttgaga 1800 

caccgccttc tgcccaccac tcacggacac atttctgcct agaaaacagc ttcttacugc 1860 

tcttacatgt gatggcatat cttacactaa aagaatatta ttgggggaaa aactacaagt 1920 

30 gctgtacata tgctgagaaa ctgcagagca taatagctgc cacccaaaaa tctttttgaa 1980 

aatcatttcc agacaacccc ttactctctg tgtagttttt aattgttaaa aaaaaaaagc 2040 

tttaaacaga agcacatgac atatgaaagc ctgcaggact ggtcgttttt ttggcaattc 2100 

ttccacgtgg gacttgtcca caagaatgaa agtagtggtt tttaaagagt taagttacat 2160 

atttattttc tcacttaagt tatttatgca aaagtttttc ttgtagagaa tigacaatgtt 2220 

35 aatattgctt catgaattaa cagcctgctc ctccagagtc cagagacatt gttaataaag 2280 

acaatgaatc atgaccgaaa g 



2301 
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< 210 > 8 
<2 11 > 213 
<2 12 > PRT 

5 <213> Homo sapien 

< 4 00 > 8 

Met Gin Leu Pro Leu Ala Leu Cys Leu Val Cys Leu Leu Val His Thr 

15 10 15 

10 Ala Phe Arg Val Val Glu Gly Gin Gly Trp Gin Ala Phe Lys Asn Asp 

20 25 30 

Ala Thr Glu lie He Arg Glu Leu Gly Glu Tyr Pro Glu Pro Pro Pro 

35 40 45 

Glu Leu Glu Asn Asn Lys Thr Met Asn Arg Ala Giu Asn Gly Gly Arg 

15 50 55 ' 60 

Pro Pro His His Pro Phe Glu Thr Lys Asp Val Ser Glu Tyr Ser Cys 

65 70 75 80 

Arg Glu Leu His Phe Thr Arg Tyr Val Thr Asp Gly Pro Cys Arg Ser 

85 90 95 

20 Ala Lys Pro Val Thr Glu Leu Val Cys Ser Gly Gin Cys Gly Pro Ala 

100 105 110 

Arg Leu Leu Pro Asn Ala lie Gly Arg Gly Lys Trp Trp Arg Pro Ser 

115 120 125 

Gly Pro Asp Phe Arg Cys Il.e Pro Asp Arg Tyr Arg Ala Gin Arg Val 

25 130 135 140 

Gin Leu Leu Cys Pro Gly Gly Glu Ala Pro Arg Ala Arg Lys Val Arg 

145 150 155 160 

Leu Val Ala Ser Cys Lys Cys Lys Arg Leu Thr Arg Phe His Asn Gin 

165 170 175 

30 Ser Glu Leu Lys Asp Phe Gly Thr Glu Ala Ala Arg Pro Gin Lys Gly 

180 185 190 

Arg Lys Pro Arg Pro Arg Ala -Arg Ser Ala Lys Ala Asn Gin Ala Glu 

195 200 205 

Leu Glu Asn Ala Tyr 
35 210 




WO 00/32773 



PCT/US99/27990 



10 

< 21 0> 9 
<2 1 1 > 642 
<212 > DNA 

<213> Cercopithecus pygerythrus 
5 

<4 00> 9 



atgcagctcc cactggccct gtgtcttgtc: tgcctgctgg tacacgcagc cttccgtgta 60 

gtggagggcc aggggtggca ggcctccaag aatgatgcca cggaaatcat ccccgagctc 120 

ggagagtacc ccgagcctcc accggagctg gagaacaaca agaccatgaa ccgggcggag 180 

10 aacggagggc ggcctcccca ccaccccttt gagaccaaag acgtgtccga gtacagctgc 240 

cgagagctgc acctcacccg ctacgtgacc gatgggccgt gccgcagcgc caagccagtc 300 

accgagttgg tgcgctccgg ccagtgcggc ccggcacgcc tgctgcccaa cgccatcggc 360 

cgcggcaagt ggcggcgccc gagtgggccc gacttccgct gcatccccga ccgctaccgc 420 

gcgcagcgtg tgcagctgct gtgccccggt ggtgccgcgc cgcgcgcgcg caaggtgcgc 480 

15 ctggtggcct cgtgcaagtg caagcgcctc acccgcttcc acaaccagtc ggagctcaag 540 

gacttcggtc ccgaggccgc tcggccgcag aagggccgga agccgcggcc ccgcgcccgg 600 
ggggccaaag ccaatcaggc cgagctggag aacgcctact ag 642 



< 210 > 10 

20 <21 1> 213 

<212> PRT 

<213> Cercopithecus pygerythrus 
<4 00> 10 

25 Met Gin Leu Pro Leu Ala Leu Cys Leu Val Cys Leu Leu Val His Ala 
15 10 15 

Ala Phe Arg Val Val Glu Gly Gin Gly Trp Gin Ala Phe Lys Asn Asp 
20 25 30 

Ala Thr Glu lie lie Pro Glu Leu Gly Glu Tyr Pro Glu Pro Pro Pro 

30' 35 40 45 

Glu Leu Glu Asn Asn Lys Thr Met Asn Arg Ala Glu Asn Gly Gly Arg 

50 55-60 

Pro Pro His His Pro Phe Glu Thr Lys Asp Val Ser Glu Tyr Ser Cys 

65 70 75 80 

35 : Arg Glu Leu His Phe Thr Arg Tyr Val Thr * Asp Gly Pro Cys Arg Ser 



85 



90 



95 
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Ala 


Lys 


Pro 


Val 


Thr 


Glu 


Leu 


Val Cys Ser 


Gly 


Gin 


Cys 


Gly 


Pro 


Ala 








100 








105 








110 






Arg 


Leu 


Leu 


Pro 


Asn 


Ala 


He 


Gly Arg Gly Lys 


Trp 


Trp 


Arg 


Pro 


Ser 






115 










120 






125 








Gly 


Pro 


Asp 


Phe 


Arg 


Cys 


lie 


Pro Asp Arg 


Tyr 


Arg 


Ala 


Gin 


Arg 


Val 




130 










135 






140 










Gin 


Leu 


Leu 


Cys 


Pro 


Gly Gly 


Ala Ala Pro 


Arg 


Ala 


Arg 


Lys 


Val 


Arg 


145 










150 






155 










16 0 


Leu 


Val 


Ala 


Ser 


Cys 


Lys 


Cys 


Lys Arg Leu 


Thr 


Arg 


Phe 


His 


Asn 


Gin 










165 






170 










175 




Ser 


Glu 


Leu 


Lys 


Asp 


Phe 


Gly 


Pro Glu Ala 


Ala 


Arg 


Pro 


Gin 


Lys 


Gly 








180 








185 








190 






Arg 


Lys 


Pro 


Arg 


Pro 


Arg 


Ala 


Arg Gly Ala 


Lys 


Ala 


Asn 


Gin 


Ala 


Glu 






195 










200 






205 








Leu 


Glu 


Asn 


Ala 


Tyr 





















210 



20 



< 210 > 11 

<2ll> 638 

<2 12> DNA 

<213> Mus musculus 



25 



30 



35 



<400 

atgcagccct 

gtggagggcc 

ggagagtacc 

ggcagacctc 

ctgcactaca 

ttggtgtgct 

aagtggtggc 

cgggtgcagc 

gcctcgtgca 

gggccggaga 

aaagccaacc 



> 11 

cactagcccc 

aggggtggca 

ccgagcctcc 

cccaccatcc 

cccgcttcct 

ccggccagtg 

gcccgaacgg 

tgctgtgccc 

agtgcaagcg 

ccgcgcggcc 

aggcggagct 



gtgcctcatc 

agccttcagg 

tcctgagaac 

ctatgacgcc 

gacagacggc 

cggccccgcg 

accggattcc 

cgggggcgcg 

cct'cacccgc 

gcagaagggt 

ggagaacgcc 



tgcctactcg 

aatgatgcca 

aaccagacca 

aaaggtgtgt 

ccatgccgca 

cggctgctgc 

cgctgcatcc 

gcgccgcgct 

ctccacaacc 

cgcaagccgc 

tactagag 



tgcacgctgc 

cagaggtcat 

tgaaccgggc 

ccgagtacag 

gcgccaagcc 

ccaacgccat 

cggatcgcta 

cgcgcaaggt 

agtcggagct 

ggcccggcgc 



cttctgtgct 

cccagggctt 

ggagaatgga 

ctgccgcgag 

ggtcaccgag 

c 999cgcgtg 

ccgcgcgcag 

gcgtctggtg 

caaggacttc 

ccggggagcc 



60 

120 

180 

240 

300 

360 

420 

480 

540 

600 

638 



< 21 0 > 12 
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12 



5 



10 



15 



20 



30 



< 21 1 > 211 
<212> PRT 
<2 13> Mus musculus 



<4 00> 12 



Met 


Gin 


Pro 


Ser 


Leu 


Ala 


Pro 


Cys 


Leu 


lie 


Cys 


Leu 


Leu 


Val 


His 


Ala 


l 








5 










10 










15 




Ala 


Phe 


Cys 


Ala 


Val 


Glu 


Gly Gin Gly 


Trp 


Cln 


Ala 


Phe 


Arg 


Asn 


Asp 








20 










25 










30 






Ala 


Thr 


Glu 


val 


lie 


Pro 


Gly Leu 


Gly 


Glu 


Tyr 


Pro 


Glu 


Pro 


Pro 


Pro 






35 










40 










45 








Glu 


Asn 


Asn 


Gin 


Thr 


Met 


Asn 


Arg 


Ala 


Glu 


Asn 


Gly 


Gly Arg 


Pro 


Pro 




50 










55 










60 










His 


His 


Pro 


Tyr 


Asp 


Ala 


Lys 


Asp 


Val 


Ser 


Glu 


Tyr 


Ser 


Cys 


Arg 


Glu 


65 










70 










75 










80 


Leu 


His 


Tyr 


Thr 


Arg 


Phe 


Leu 


Thr 


Asp 


Gly 


Pro 


Cys 


Arg 


Ser 


Ala 


Lys 










85 










90 










95 




Pro 


val 


Thr 


Glu 


Leu 


val 


Cys 


Ser 


Gly 


Gin 


Cys 


Gly 


Pro 


Ala 


Arg 


Leu 








100 










105 










110 






Leu 


Pro 


Asn 


Ala 


He 


Gly 


Arg 


Val 


Lys 


Trp 


Trp Arg 


Pro 


Asn Gly 


Pro 






115 










120 










125 








Asp 


Phe 


Arg 


Cys 


■lie 


Pro 


Asp 


Arg. Tyr 


Arg 


Ala 


Gin 


Arg 


val 


Gin 


Leu 




130 










135 










140 










Leu 


Cys 


Pro 


Gly Gly 


Ala 


Ala 


Pro 


Arg 


Ser 


Arg 


Lys 


Val 


Arg 


Leu 


Val 


145 










150 










155 




- 






160 


Ala 


Ser 


Cys 


Lys 


Cys 


Lys 


Arg 


Leu 


Thr 


Arg 


Phe 


His 


Asn 


Gin 


Ser 


Glu 










165 










170 










175 




Leu 


Lys 


Asp 


Phe 


Gly 


Pro 


Glu 


Thr 


Ala 


Arg 


Pro 


Gin 


Lys 


Gly Arg 


Lys 








180 










185 










190 






Pro 


Arg 


Pro 


Gly Ala 


Arg 


Gly Ala 


Lys 


Ala 


Asn 


Gin 


Ala 


Glu 


Leu 


Glu 



195 2C0 205 



Asn Ala Tyr 
210 



35 



<210> 13 



<211 > 674 
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<212=> DNA 

<213> Rattus norvegicus 
<4 00> 13 

5 gaggaccgag tgcccttcct ccttctggca 

cccgcctgct tgtacatgca gccttcgttg 
agaatgatgc cacagaaatc atcccgggac 
tagagaacaa ccagaccatg aaccgggccg 
atgacaccaa agacgtgtcc gagtacagct 
10 ccgacggccc gtgccgcagt gccaagccgg 

gccccgcgcg gctgctgccc aacgccatcg 
ccgacttccg ctgcatcccg gatcgctacc 
gcggcgcggc gccgcgctcg cgcaaggtgc 
tcacccgctt ccacaaccag tcggagctca 
15 agaagggtcg caagccgcgg ccccgcgccc 

agaacgccta ctag 

<210> 14 
<2ll> 213 
20 <212 > PRT 

<213> Rattus norvegicus 

<400> 14 

Met Gin Leu Ser Leu Ala Pro Cys 
25 1 5 

Ala Phe Val Ala Val Glu Ser Gin 
20 

Ala Thr Glu lie lie Pro Gly Leu 
35 40 

30 Glu Leu Glu Asn Asn Gin Thr Met 
50 55 

Pro Pro His His Pro Tyr Asp Thr 
65 70 

Arg Glu Leu His Tyr Thr Arg Phe 
85 

Ala Lys Pro Val Thr Glu Leu Val 



13 



ccatgcagct 


ctcactagcc 


ccttgccttg 


60 


ctgtggagag 


ccaggggtgg 


caagccttca 


120 


tcagagagta 


cccagagcct 


cctcaggaac 


180 


agaacggagg 


cagacccccc 


caccatcctt 


240 


gccgcgagct 


gcactacacc 


cgcttcgtga 


300 


tcaccgagtt 


ggtgtgctcg 


ggccagtgcg 


360 


ggcgcgtgaa 


gtggtggcgc 


ccgaacggac 


420 


gcgcgcagcg 


ggtgcagctg 


ctgtgccccg 


480 


gtctggtggc 


ctcgtgcaag 


tgcaagcgcc 


540 


aggacttcgg 


acctgagacc 


gcgcggccgc 


600 


ggggagccaa 


agccaaccag 


gcggagctgg 


660 








674 



Leu Ala Cys Leu Leu Val His Ala 
10 15. 

Gly Trp Gin Ala Phe Lys Asn Asp 
25 30 

Arg Glu Tyr Pro Glu Pro Pro Gin 
45 

Asn Arg Ala Glu Asn Gly Gly Arg 
60 

Lys Asp Val Ser Glu Tyr Ser Cys 
75 80 

Val Thr Asp Gly Pro Cys Arg Ser 
90 95 

Cys Ser Gly Gin Cys Gly Pro Ala 



35 
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100 105 110 

Arg Leu Leu Pro Asn Ala He Gly Arg Val Lys Trp Trp Arg Pro Asn 
115 120 125 

Gly Pro Asp Phe Arg Cys lie Pro Asp Arg Tyr Arg Ala Gin Arg Val 

5 130 135 140 

Gin Leu Leu Cys Pro Gly Gly Ala Ala Pro Arg Ser Arg Lys Val Arg 

145 150 155 160 

Leu Val Ala Ser Cys Lys Cys Lys Arg Leu Thr Arg Phe His Asn Gin 
165 170 175 

10 Ser Glu Leu Lys Asp Phe Gly Pro Glu Thr Ala Arg Pro Gin Lys Gly 
190 185 190 

Arg Lys Pro Arg Pro Arg Ala Arg Gly Ala Lys Ala Asn Gin Ala Glu 
195 200 205 

Leu Glu Asn Ala Tyr 
15 210 

<210> 15 
<211> 532 
<212> DNA 

20 <213 > Bos torus 



<400> 15 

agaatgatgc cacagaaatc atccccgagc tgggcgagta ccccgagcct ctgccagagc 60 

tgaacaacaa gaccatgaac cgggcggaga acggagggag acctccccac cacccctttg 120 

25 agaccaaaga cgcctccgag tacagctgcc gggagctgca cttcacccgc tacgtgaccg 180 

a t 999 c cgtg ccgcagcgcc aagccggtca ccgagctggt gtgctcyggc cagtgcggcc 240 

cggcgcgcct gctgcccaac gccatcggcc gcggcaagtg gtggcgccca agcgggcccg 300 

acttccgctg catccccgac cgctaccgcg cgcagcgggt gcagctgttg tgtcctggcg 360 

gcgcggcgcc gcgcgcgcgc aaggtgcgce tggtggcctc gtgcaagtgc aagcgcctca 420 

30 ctcgcttcca caaccagtcc gagctcaagg acttcgggcc cgaggccgcg cggccgcaaa 480 

cgggccgaaa gctgcggccc cgcgcccggg gcaccaaagc cagccgggcc ga 532. 



<210> 16 
<211 > 176 
35 <212> PRT 

<213> Bos torus 
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15 



5 



10 



15 



20 



30 



35 



<4 0 0> 16 



Asn 


Asp 


Ala 


Thr 


Glu 


lie 


He 


Pro 


Glu 


Leu 


Gly Glu 


Tyr 


Pro 


Glu 


Pro 


1 








5 










10 








15 




Leu 


Pro 


Glu 


Leu 


Asn 


Asn 


Lys 


Thr 


Met 


Asn Arg Ala 


Glu 


Asn 


Gly 


Gly 








20 










25 








30 






Arg 


Pro 


Pro 


His 


His 


Pro 


Phe 


Glu 


Thr 


Lys 


Asp Ala 


Ser 


Glu 


Tyr 


Ser 






35 










40 








45 








Cys 


Arg 


Glu 


Leu 


His 


Phe 


Thr 


Arg 


Tyr 


Val 


Thr Asp 


Gly 


Pro 


Cys 


Arg 




50 










55 








60 










Ser 


Ala 


Lys 


Pro 


Val 


Thr 


Glu 


Leu 


Val 


Cys 


Ser Gly 


Gin 


Cys 


Gly 


Pro 


65 










70 










75 








80 


Ala 


Arg 


Leu 


Leu 


Pro 


Asn 


Ala 


lie 


Gly Arg Gly Lys 


Trp 


Trp 


Arg 


Pro 










85 










90 








95 




Ser 


Gly 


Pro 


Asp 


Phe 


Arg 


Cys 


lie 


Pro 


Asp Arg Tyr 


Arg 


Ala 


Gin 


Arg 








100 










105 








110 






val 


Gin 


Leu 


Leu 


Cys 


Pro 


Gly 


Gly 


Ala 


Ala 


Pro Arg 


Ala 


Arg 


Lys 


Val 






115 










120 








125 








Arg 


Leu 


val 


Ala 


Ser 


Cys 


Lys 


Cys 


Lys 


Arg 


Leu Thr 


Arg 


Phe 


His 


Asn 




130 










135 








140 










Gin 


Ser 


Glu 


Leu 


Lys 


Asp 


Phe 


Gly 


Pro 


Glu 


Ala Ala 


Arg 


Pro 


Gin 


Thr 


145 










150 










155 








160 


Gly 


Arg 


Lys 


Leu 


Arg 


Pro 


Arg 


Ala 


Arg Gly Thr Lys 


Ala 


Ser 


Arg 


Ala 



165 170 175 
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cgcgttttgg tgagcagcaa tattgcgctt cgacgagcct tggcg^tgag attgatacct 60 

ctgccgcaca aaaggcaatc gaccgagctg gaccagcgca ttcgtgacac cgcctcctnc 120 

gaacttatcc gcaatggagt gtcactcatc aaggacngcc tgatcgcaaa tggtgctarc 180 

cacgcagcgg caatcgaaaa ccctcagccg gtgaccaata tctacaacat cagccttggt 240 

5 atcctgcgcg acgagccagc gcagaacaag gtaaccgtca gtgccgacaa gtccaaagnt 300 

aaacctggtg ttgataccaa cattgaaacg ttgatcgaaa acgcgctgaa aaacgctgct 360 

gaatgtgcgg cgctggatgt cacaaagcaa acggcagcag acaagaaagc gatggatgaa 420 

ctggcttcct atgtccgcac ggccatcatg atggaatgtt tccccggtgg tgttatctgg 480 

cagcagtgcc gccgatagta tgcaattgat aattattatc atttgcgggt cctttccggc 540 

10 gatccgcctt gttacggggc ggcgacctcg cgggttttcg ctatttatga aaattttccg 600 

gtttaaggcg tttccgttct tcttcgtcat aacttaatgt ttttatttaa aataccctct 660 

gaaaagaaag gaaacgacag gtgctgaaag cgagcttttt ggcctctgtc gtttcctttc 720 

tctgtttttig tccgtggaat gaacaatgga agtcaacaaa aagcagagct tatcgatgat 780 

aagcggtcaa acatgagaat tcgcggccgc ataatacgac tcactatagg gatcgacgcc 840 

15 tactccccgc gcatgaagcg gaggagctcg actccgcatg ccc agagacg ccccccaacc 900 

cccaaagtgc ctgacctcag cctctaccag ctctggcttg ggcttgggcg gggtcaaggc 960 

taccacgttc tcttaacagg tggctgggct gtctcttggc cgcgcgtcat gcgacagctg 1020 

cctagttctg cagtgaggtc accgtggaat gtctgccttc gttgccatgg caacgggatg 1080 

acgttacaat ctgggtgtgg agcttttcct gtccgtgtca ggaaatccaa ataccctaaa 1140 

20 ataccctaga agaggaagta gctgagccaa ggctttcctg gcttctccag ataaagtttg 1200 

acttagatgg aaaaaaacaa aatgataaag acccgagcca tctgaaaatt cctcctaatt 1260 

gcaccactag gaaatgtgta tattattgag ctcgtatgtg ttcttatttt aaaaagaaaa 1320 

ctttagtcat gttattaata agaatttctc agcagtggga gagaaccaat attaacacca 1380 

agataaaagt tggcatgatc cacattgcag gaagatccac gttgggtttt catgaatgtg 1440 

25 aagaccccat ttattaaagt cctaagctct gtttttgcac actaggaagc gatggccggg 1500 

atggctgagg ggctgtaagg atctttcaat gtcctacatg tgtgtttcct gtcctgcacc 1560 

taggacctgc tgcctagcct gcagcagagc cagaggggtt tcacatgatt agtctcagac 1620 

acttgggggc aggttgcatg tactgcatcg cttatttcca tacggagcac ctactatgtg 1680 

tcaaacacca tatggtgttc actcttcaga acggtggtgg tcatcatggt gcatttgctg 1740 

30 acggttggat. tcgtggtaga gagctgagat atatggacgc actcttcagc attctgtcaa 1800 

cgtggctgtg cattcttgct cctgagcaag tggctaaaca gactcacagg gtcagcctcc 1860 

agctcagtcg ctgcatagtc ttagggaacc tctcccagtc ’ ctccctacct caactatcca 1920 
agaagccagg gggcttggcg gtctcaggag cctgcttgct gggggacagg ttgttgagtt 1980 

ttatctgcag taggttgccc aggcatagtg tcaggactga tggctgcctt ggagaacaca 2040 

35 tcctttgccc tctatgcaaa tctgaccttg acatgggggc g'ctgctcagc tgggaggatc 2100 

aactgcatac ctaaagccaa gcctaaagct tcttcgtcca cctgaaactc ctggaccaag 



2160 
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gggcttccgg cacatcctct caggccagtg agggagtctg tgtgagctgc actttccaat 2220 

ctcagggcgt- gagaggcaga gggaggtggg ggcagagcct tgcagctctt tcctcccatc 2280 

tggacagcgc tctggctcag cagcccatat gagcacaggc acatccccac cccaccccca 2340 

cctttcctgt cctgcagaat ttaggctctg ttcacggggg gggggggggg ggggcagtcc 2400 

5 tatcctctct taggtagaca ggactctgca ggagacactg ctttgtaaga tactgcagtt 2460 

taaatttgga tgttgtgagg ggaaagcgaa gggcctcttt gaccattcag tcaaggtacc 2520 

ttctaactcc catcgtattg ggyggctact ctagtgctag acattgcaga gagcctcaga 2580 

actgtagtta ccagtgtggt aggattgatc cttcagggag cctgacatgt gacagttcca 2640 

ttcttcaccc agtcaccgaa catttattca gtacctaccc cgtaacaggc accgtagcag 2700 

10 gtactgaggg acggaccact caaagaactg acagaccgaa gccttggaat ataaacacca 2760 

aagcatcagg ctctgccaac agaacactct ttaacactca ggccctttaa cacccaggac 2820 

ccccaccccc accccaagca gttggcactg ctatccacat tttacagaga ggaaaaacta 2880 

ggcacaggac gatataagtg gcttgcttaa gcttgtctgc atggtaaatg gcagggctgg 2940 

attgagaccc agacattcca actctagggt ctatttttct tttttctcgt tgttcgaatc 3000 

15 tgggtcttac tgggtaaact caggctagcc tcacactcat atccttctcc catggcttac 3060 

gagtgctagg attccaggtg tgtgctacca tgtctgactc cctgtagctt gtctatacca 3120 

tcctcacaac ataggaattg tgatagcagc acacacaccg gaaggagctg gggaaatccc 3180 

acagagggct ccgcaggatg acaggcgaat gcctacacag aaggtgggga agggaagcag 3240 

agggaacagc atgggcgtgg gaccacaagt ctatttgggg aagctgccgg taaccgtat a 3300’ 

20 tggctggggt gaggggagag gtcatgagat gaggcaggaa gagccacagc aggcagcggg 3360 

tacgggctcc ttattgccaa gaggctcgga tcttcctcct cttcctcctt ccggggctgc 3420 

ctgttcattt tccaccactg cctcccatcc aggtctgtgg ctcaggacat cacccagctg 3480 

cagaaactgg gcatcaccca cgtcctgaat gctgccgagg gcacgtcctt catgcacgtc 3540 

aacaccagtg ctagcttcta cgaggattct ggcatcacct acttgggcat caaggccaat 3600 

25 gatacgcagg agttcaacct cagtgcttac tttgaaaggg ccacagattt cattgaccag 3660 

gcgctggccc ataaaaatgg taaggaacgt acattccggc acccatggag cgtaagccct 3720 

ctgggacctg cttcctccaa agaggccccc acttgaaaaa ggttccagaa agatcccaaa 3780 

atatgccacc aactagggat taagtgtcct acatgtgagc cgatgggggc cactgcatat 3840 

agtctgtgcc atagacatga caatggataa taatatttca gacagagagc aggagttagg 3900 

30 tagctgtgct cctttccctt taattgagtg tgcccatttt tttattcatg tatgtgtata 3960 

catgtgtgtg cacacatgcc ataggttgat actgaaca c'c gtcttcaatc gttccccacc 4020 

ccaccttatt ttttgaggca gggtctcttc cctgatcctg gggctcattg gtttatctag 4080 

gctgctggcc agtgagctcc ggagttctgc ttttctctac ctccctagcc ctgggactgc 4140 

aggggcatgt gctgggccag gcttr.tar.gt cgcgttgggg atctgaactt aggtccctag 4200 

35 gcctgagcac cgtaaagact ctgccacatc cccagcctgt ttgagcaagt gaaccattcc 4260 

ccaaaattcc cccagtgggg ctttcctacc cttttattgg ctaggcattc atgagtggtc 



4320 
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acctcgccag 


aggaatgagt 


ggccacgact 


ggctcagggt 


cagcagccta 


gagatactgg 


4380 




gttaagtctt 


ccugccgctc 


gctccctgca 


gccgcagaca 


gaaagtagga 


ctgaatgaga 


4440 




gctggctagt 


ggccagacag 


gacagaaggc 


tgagagggtc 


acagggcaga 


tgtcagcaga 


4500 




gcagacaggt 


tccccctctg 


tgggggaggg 


gtggcccact 


gcaggtgtaa 


ttggccttct 


4560 


5 


ttgtgctcca 


tagaggcttc 


ctgggtacac 


agcagcttcc 


ctgtcctggt 


gattcccaaa 


4620 




gagaactccc 


taccaccgga 


cttacagaag 


ttctattgac 


tcgtgtaacg 


gttcaacagc 


4680 




tttggctctt 


gg^ggacggt 


gcatactgct 


gtatcagctc 


aagagctcat 


tcacgaatga 


4740 




acacacacac 


acacacacac 


acacacacac 


acacaagcta 


attttgatat 


gccttaacta 


4800 




gctcagtgac 


tgggcatttc 


tgaacatccc 


tgaagttagc 


acacatttcc 


ctctggtgtt 


4860 


10 


cctggcttaa 


caccctctaa 


atcfcatattt 


tatctttgct 


gccctgttac 


cttctgagaa 


4920 




gcccctaggg 


ccacttccct 


tcgcacctac 


attgctggat 


ggtttctctc 


ctgcagctct 


4980 




taaatctgat 


ccctictgcct 


ctgagccatg 


ggaacagccc 


aataactgag 


ttagacataa 


5040 




aaacgtctct 


agccaaaact 


tcagctaaat 


ttagacaata 


aatcttactg 


gttgtggaat 


5100 




cctcaagatt 


cttcatgacc 


tccttcacat 


ggcacgagta 


tgaagcttta 


ttacaattgt 


5160 


15 


ttattgatca 


aactaactca 


taaaaagcca 


gttgtctttc 


acctgctcaa 


ggaaggaaca 


5220 




aaactcatcc 


ttaactgatc 


tgtgcacctt 


gcacaatcca 


tacgaatatzc 


ttaagagtac 


5280 




taagattttg 


gttgtgagag 


tcacatgtta 


cagaatgtac 


agctttgaca 


aggtgcatcc 


5340 




ttgggatgcc 


gaagtgacct 


gctgttccag 


ccccctacct 


tctgaggctg 


ttttggaagc 


5400 




aatgctctgg 


aagcaacttt 


aggaggtagg 


atgctggaac 


agcgggtcac 


ttcagcatc c 


5460 


20 


cgacgacgaa 


tcccgtcaaa 


gctgtacatt 


ctgtaacaga 


ctgggaaagc 


tgcagacttt 


5520 




aaggccaggg 


ccctatggtc 


cctcttaatc 


cctgtcacac 


ccaacccgag 


cccttctcct 


5580 




ccagccgttc 


tgtgcttctc 


actctggata 


gatggagaac 


acggccttgc 


tagttaaagg 


5640 




agtgaggctt 


cacccttictc 


acatggcagt 


ggttggtcat 


cctcattcag 


ggaactctgg 


5700 




ggcattctgc 


ctttacttcc 


tctttttgga 


ctacagggaa 


tatatgctga 


cttgttttga 


5760 


25 


ccttgtgtat 


ggggagactg 


gatctttggc 


ctggaacgtt 


tcctgctagt 


ttttccccat 


5820 




cctttggcaa 


accctatcta 


tatcttacca 


ctaggcatag 


tggccctcgt 


tctggagcct 


5880 




gccctcaggc 


tggttctcgg ggaccatgtc 


cctggtttct 


ccccagcata 


tggtgttcac 


5940 




agtgttcact 


gcgggtggtt 


gctgaacaaa 


gcggggattg 


catcccagag 


ccccggtgcc 


6000 




tngcgggtac 


actgctaaga 


taaaacggat 


actggcctct 


ctctgaccac 


ttgcagagct 


6060 


30 


ctggtgcctt 


gtgggtacac 


tgctaagata 


aaatggatac 


tggcctctct 


ctaCccactt 


6120 




gcaggactct: 


agggaacagg 


aacccaccac 


cgagaaaacc 


aggggctagg 


agcagggagg 


6180 




tagctgggca 


gctgaagtgc 


ttggcgacta 


accaatgaat 


accagagttt 


ggatctctag 


6240 




aatactctta 


aaacctgggt 


gggcagagtg 


gcctgcccgc 


aatcccagaa 


ctcgggaggc 


6300 




ggagacaggg 


aatcatcaga 


gcaaaccggc 


taaccagaat 


agceaaacac 


tgagctctgg 


6360 


35 


gctctgtgag 


agatcctgcc 


tcaacacata 


agagagagaa 


taaaacattg 


aagaagacag 


6420 




tagatgccaa 


ttttaagccc 


ccacatgca c 


atggacaagt 


gtgcgtttga 


acacacatat 


6480 
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gcactcatgc 


gaaccaggca 


tycacactcg 


ggcttatcac 


acacataatt 


tgaaagagag 


6 54 0 




agtgagagag 


gagagtgcac 


attagagttc 


acaggaaagt 


gtgagtgagc 


acacccatgc 


6600 




acacagacat 


gtgtgccagg 


gagr.aggaaa 


ggagcctggg 


tttgtgtata 


agagggagcc 


6660 




atcatgtgtt 


tctaaggagg 


gcgtgtgaag 


gaggcgttgt 


gtgggctggg 


actggagcat 


6720 


5 


ggttgtaact 


gagcatgctc 


cctgtgggaa 


acaggagggt 


ggccaccctg 


cagagggtcc 


6780 




cactgtccag 


cgggatcagt 


aaaagcccct 


gctgagaact 


ttaggtaata 


gccagagaga 


6840 




gaaaggtagg 


aaagtggggg 


gactcccatc 


tctgatgtag 


gaggatctgg 


gcaagtagag 


6900 




gtgcgtttga 


ggtaqaaaga 


ggggtgcaga 


ggagatgctc 


ttaattctgg 


gtcagcagtt 


6960 




tctttccaaa 


taatgcctgt 


gaggaggtgt 


aggtggtggc 


cattcactca 


ctcagcagag 


7020 


10 


ggatgatgat 


gcccggtgga 


•tgctggaaat 


ggccgagcat 


caaccctggc 


tctggaagaa 


7080 




ctccatcttt 


cagaaggaga 


gtggatctgt 


gtatggccag 


c 9gggtcaca 


ggtgcttggg 


7140 




gcccctgggg 


gactcctagc 


actgggtgat 


gtttatcgag 


tgctcttgtg 


tgccaggcac 


7200 




tggcctgggg 


ctttgtttct 


gtctctgttt 


tgtttcgttt 


tttgagacag 


actcttgcta 


72 6 0 




tgtatccgtg 


tcaatcttgg 


aatctcactg 


catagcccag 


gctgcggaga 


gaggggaggg 


7320 


15 


caataggcct 


tgtaagcaag 


ccacacttca 


gagactagac 


tccaccctgc 


gaatgatgac 


7380 




aggtcagagc 


tgagttccgg 


aagattttt t 


ttccagctgc 


caggtggagt 


gtggagtggc 


7440 




agctagcggc 


aagggtagag 


ggcgagctcc 


ctgtgcagga 


gaaatgcaag 


caagagatgg 


7500 




caagccagtg 


agttaagcat 


tctgtgtggg 


gagcaggtgg 


atgaagagag 


aggctgggct 


7560 




ttcgcctctg 


gggggggggt 


gaggggtggg 


gatgaggtga 


gaggagggca 


gctccctgca 


7620 


20 


gtgtgatgag 


atttttcctg 


acagtgacct 


ttggcctctc 


cctcccccac 


ttcccttctt 


7680 




tcctttcttc 


ccaccattgc 


tttccttgtc 


cttgagaaat 


tctgagtttc 


cacttcactg 


7740 




gtgatgcaga 


cggaaacaga 


agccgtgtgt 


gtgtgtgtgt 


gtgtgtgtgt 


gtgtgtgtgt 


7800 




gtgtgtgtgt 


ttgtgtgtat 


gtgtgtgtgt 


gtgtttgtgt 


gtatgtgtgt 


cagtgggaat 


7860 




ggctcatagt 


ctgcaggaag .gtgggcagga 


aggaataagc 


tgtaggctga 


ggcagtgtgg 


7920 


25 


gatgcaggga 


gagaggagag 


gagggatacc 


agagaaggaa 


attaagggag 


ctacaagagg 


7980 




gcattgttgg 


ggtgtgtgtg 


tgtgtgtgtt 


gtttatattt 


gtattggaaa 


tacattcttt 


8040 




taaaaaatac 


ttatccattt 


atttattttt 


atgtgcacgt 


gtgtgtgcct 


gcatgagttc 


8100 




atgtgtgcca 


cgtgtgtgcg 


ggaacccttg 


gaggccacaa 


gggggcatct 


gatcccctgg 


8160 




aactggagtt 


ggaggaggtt 


gtgagtcccc 


tgacatgttt 


gctgggaact 


gaaccccggt 


8220 


30 


cccatgcaag 


agcaggaagt 


gcagttatct 


gctgagccat 


ctctccagtc 


ctgaaatcca 


8280 




ttctcttaaa 


atacscgtgg 


cagagacatg 


atgggattta 


cgtatggatt 


taatgtggcg 


8340 




gtcattaagt 


tccggcacag 


gcaagcacct 


gtaaagccat 


caccacaacc 


gcaacagtga 


8400 




atgtgaccat 


cacccccatg 


ttcttcatgt 


cccctgtccc 


ctccatcctc 


cattctcaag 


8460 




cacctcttgc 


tctgcctctg 


tcgctggaga 


acagtgtgca 


tctgcacact 


cttatgtcag 


8520 


35 


tgaagtcaca 


cagcctgcac 


cccttcctgg 


tctgagtatt 


tgggttctga 


ctctgctatc ' 


8580 




acacactact 


gtactgcatt 


ctctcgctct 


ctttttttaa 


acatattttt 


atttgtttgt 


0640 
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gtgtatgcac 


atgtgccaca 


tgtgtacaga 


tactatggag 


gccagaagag 


gccatggccg 


8700 




tccctggagc- 


tggagttaca 


ggcagcgtgt 


gagccgcctg 


gtgtgggtgc 


tgggaaccaa 


8760 




acttgaatct 


aaagcaagca 


cttttaactg 


ctgaggcagc 


tctcagtacc 


cttcttcatt 


8820 




tctccgcctg 


ggttccattg 


tatggacaca 


tgtagctaga 


atatcttgct 


tatctaatta 


8880 


5 


tgtacattgt 


tttgtgctaa 


gagagagtaa 


tgctctatag 


cctgagctgg 


cctcaacctt 


8940 




gccatcctcc 


tgcctcagcc 


tcctcctcct 


gagtgctagg 


atgacaggcg 


agtggtaact 


9000 




tacatggttt 


catgttttgt 


tcaagactga 


aggataacat 


tcatacagag 


aaggtctggg 


9060 




tcacaaagtg 


tgcacttcac 


tgaatggcac 


aacccgtgat 


caagaaacaa 


aactcagggg 


9120 




ctggagagat 


ggcactgact 


gctcttccag 


aggtccggag 


ttcaattccc 


agcaaccaca 


9180 


10 


tggtggctca 


cagccatcta 


taacgagatc 


tgacgccctc 


ttctggtgtg 


tctgaagaca 


9240 




gctacagtgt 


actcacataa 


aataaataaa 


tctttaaaac 


acacacacac 


acacaattac: 


9300 




caccccagaa 


agcccactcc 


atgttcccte 


ccacgtctct 


gcctacagta 


ctcccaggtn 


9360 




accactgttc 


aggcttctaa 


caacctggtt 


tacttgggcc 


tcttttctgc 


tctgtggagc 


9420 




cacacatttg 


tgtgcctcat 


acacgttctt 


tctagtaagt 


tgcatattac 


tctgcgttt- 


9480 


15 


tacatgtatt 


tatttattgt 


agttgtgtgt 


gcgtgtgggc 


ccatgcatgg 


cacagtgtg- 


9540 




ggggatgtca 


gagtattgtg 


aacaggggac 


agttcttttc 


ttcaatcatg 


tgggt cccag 


9600 




aggttgaact 


caggtcatca 


tgtgtcgcag 


caaatgcctt 


tacccactga 


gacatctcca 


9660 




tattcttttt 


ttttcccctg 


aggtgggggc 


ttgttccata 


gcccaaactg 


gctttgcacn 


9720 




tgcagttcaa 


agtgactccc 


tgtctccacc 


tcttagagta 


ttggaattac 


gatgtgtact 


9780 


20 


accacacctg 


actggatcat 


taattctttg 


atgggggcgg 


ggaagcgcac 


atgctgcagg 


9840 




tgaagggatg 


actggactgg 


acatgagcgt 


ggaagccaga 


gaacagcttc 


agtctaargc 


9900 




tctcccaact 


gagctatttc 


ggtttgccag 


agaacaactt 


acagaaagtt 


ctcagtgcca 


9960 




tgtggattcg 


S'ggttggagt 


tcaactcatc 


agcttgacat 


tggctcctct 


acccactgag 


10020 




ccttctcact 


actctctacc 


tagatcatta 


attctttttt 


aaaaagactt 


attagggggc 


10080 


25 


tggagagatg 


gctcagccgt 


taagagcacc 


gaatgccctt 


ccagaggtcc 


tgagttcaat 


10140 




tcccagcatg 


ccattgctgg 


gcagtagggg 


gcgcaggtgt 


tcaacgtgag 


tagctgttgc 


10200 




cagttttccg 


cggtggagaa 


cctcttgaca 


ccctgctgtc 


cctggtcatt 


ctgggtgggt 


10260 




gcatggtgat 


atgcttgttg 


tatggaagac 


tttgactgtt 


acagtgaagt 


tgggcttcca 


10320 




cagttaccac 


gtctcccctg 


ttccttgcag 


gccgggtgct 


tgtccactgc 


cgcgagggct 


10380 


30 


acagccgctc 


cccaacgcta 


gttatcgcct 


acctcatgat 


gcggcagaag 


atggacgtca 


10440 




agtctgctct 


gagtactgtg 


aggcagaatc 


gtgagatcgg 


ccccaacgat 


ggcttcctgg 


10500 




cccaactctg 


ecagctcaat 


gacagactag 


ccaaggaggg 


caaggtgaaa 


ctccagggtg 


10560 




cccacagcct 


ctt~ tgcaga 


ggcctgactg 


ggagggccct 


ggcagccatg 


tttaggaaac 


10620 




acagtatacc 


cactccctgc 


accaccagac 


acgtgcccac 


atctgtccca 


ctctggtcct 


10680 


35 


cgggggccac 


tccaccctca 


gggagcacat 


gaagaagctc 


cccaagaagt 


tctgctcctt 


10740 




agccatcctt 


tcctgtaatt 


tatgtctctc 


cctgaggtga 


ggrtcaggtt 


tatgtccctg 


10800 
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tctgnggcat agatacatct cagtgaccca 
gggacacggg cactcttcat gacccctccc 
ccacgagcct ggtaaaggaa ctatgcaaac 
ggtcctcaca gcccgacacg ccctgctgag 
agtggagata gattagtgac tagatttcca 
aattatttcc ttacaattaa agatactaca 
tcccagaguc tatct tgatt ctttggaatg 
atctatatct gctgcttcag agccttccct 
agagcgatgc cttgggcagg gtttccccct 
attacaaaca aacaaacaaa .caaacaaaca 
tttatcctga attatagtgt tggtgagttc 
gttgctattc taagaataat taggaggagg 
gtgtgtgcac gggtgcatat gttggaaggg 
tgaaaggccc ctctgctcac cctggccatt 
tgtgcaggat cctgaaactg actcgctgca 
tggagagaga gagagcaaag aaagaaacag 
tgaacctcgc tggaccttgt atgtgtgcac 
ggacttcacg ttctcattat ttgtatgtct 
cceagcaccc ggcacatagg cgtctcataa 
gagtaccccg tgtatagaga gagttgttgt 
tgcctgcctg taggatgtct tacaggagtt 
atatcaggga tttttgttga atatttcaaa 
tcatggttaa ggtaaggaac atgccttttc 
acctgtctr.a ggatgtcact .cccagggtaa 
tgcagcctgg acggtcattg tcccctattc 
ggctggtcat cctttttttt tttttttttt 
agccaagttg tgtacctcag tctttagttt 
catttcaaaa taacactgta gagttgacag 
gagaggaaag aacaaaacaa aacaaaacac 
catgattgca atgtctacag gcccagttca 
ggtcaaggat agcatggtct acgtatcgag 
taggttctgg attttgggct' ttggttttcg 
tcctggaact cgcGCtgtag accaggctgg 
ctttgagggc tgggacgaat gccaccactg 
agttcaagat aatnaagagt tgccagctcg 
gctgcttgag gctgtGcctg gcttggaccc 



gggtgggagg gctatcaggg tgcatggccc 
ccacctgggt tcttcctgtg tggtccagaa 
acaggccctg acctccccat gtctgttcct 
gcagacgaat gacattaagt tccgaagcag 
aaaagaagga aaaaaaaggc tgcattttaa 
taggggccct tgggtaagca aatccatttt 
tctaaagtgt gccttgccag agagcttacg 
gaggatggct ctgttccttc gcttgccaga 
tttcagaata cagggtgtaa agtccagcct 
aaggacctcc atttggagaa ttgcaaggat 
aagtcancac gccaagtgct tgccatcctg 
aacctagcca attgcagctc atgtccgtgg 
gtgcctgtcc ccttggggac agaaggaaaa 
tacgggaggc tctgctggtt ccacggtgtc 
cagaaacgag acttggcggc accatgagaa 
cctttaaaag aactttctaa gggtggtttt 
atttgccaga gattgaacat aatcctcttg 
ccggggtcac gcagagccgt cagccaccac 
aagcccattt tatgagaacc agagctgttt 
cgtggggcac ccggatccca gcagcctggt 
tgcagagaaa cc ttccttgg agggaaagaa 
ttcagcttta agtgtaagac tcagcagtgt 
cagagctgct gcaagaggca ggagaagcag 
agacctctga tcacagcagg agcagagctg 
tgtgtgacca cagcaaccct ggtcacatag 
tcccccttcg gcccagaatg aagtgaccat 
ccaagcggct ctcttgctca atacaatgtg 
aacnggttca tgtgttatga gagaggaaaa 
cacaaaccaa aaacatctgg gctagccagg 
tgagaggcag agacaggaag accgccgaaa 
actccagcca gggctacggt cccaagatcc 
agacagggtt tcrctgtgta gccctggctg 
cctcaaactt agagatctgc ctgactccgc 
cccaactaag attccactaa aaaaaaaaaa 
t’aaagctaa gtagaagcag tctcaggccc 
gaaatccgcc cccaacagtg tccaagtgca 
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catgactttg 
acacagtctc 
gagggtgggt 
gacagtcact 
agattcctgg 
gtttataaaa 
cgtggctgcc 
aagtggtgga 
ccactttcta 
tctcaatctt 
attgtcttga 
gctcttccgg 
acccgccacc 
gggtgccttt 
gaagcactga 
taattct ttt 
tgtattccat 
gaagaggctc 
tgggaggtcc 
taacgcactc- 
tgtttccttt 
ttgttagttt 
cctgcccctg 
cagtacagca 
gctaagagcg 
gtggctccag 
atacccacat 
ctagcccttg 
ttctggctaa 
atggaggaag 
aggacactag 
ttatgctggg 
attttcccaa 
ctgagagctg 
ggctgtggtg 
gctagcctgt 



agccatctcc 

tctttgtcta 

ttccctcccc 

agcacctact 

ctttccatta 

ctagctacta 

cgtgagtagg 

ataggaatta 

tgacttataa 

ctaaagtctg 

gagcacagac 

gctccagcac 

cccaagtggg 

ccaccttaag 

tcttataata 

acaggctttg 

tttcctctgc 

agagggcagg 

tgccatccgg 

taaagttgta 

tgtgtgtttg 

tgaggtagga 

cccctgtccc 

tttttctaac 

cacactgctc 

gtccagagga 

acagatacac 

gaggtacgac 

cgtaagactt 

agggatgagt 

aggaggataa 

tgaaaagaga 

attgaaaaga 

gggtgagtga 

acagctacct 

ggtgcatgag 



agagaaggaa 
ggtaacacat 
ctgcctcttc 
ggaaacctct 
gggctgaaag 
ttcttcaggt 
gggtggggtt 
agggtgttcc 
acatccaggt 
cctgcctttt 
taacacttac 
tcttaagtta 
tgtggataat 
ttgcttatag 
cgaggataag 
tcctgatcgt 
cagcgttagg 
ggatggtggg 
gaggaaaagg 
accaaaata a 
ggctttttat 
tctcaggctg 
tgcctccaag 
atttaaaaac 
ttgggtagga 
tccaggacct 
acataaaaat 
tctggaaagc 
acagagacag 
agggggagca 
ataccagtca 
aggccccagg 
aaaagatgta 
gacatgtgga 
ctaatctcca 
accctgtttc 



gtgaaaattg 

ggtgacacat 

tgggttggtc 

ttgtgggaac 

tacaacggtt 

aaaataccga 

gggaatcctg 

cccccccccc 

aaaaattaca 

ccaggggtag 

caaatgaggg 

ttttaagaat 

gccatggcca 

tatttaagat 

agattttctc 

agcatagaga 

ttaactccgt 

gtgaggcaga 

aaagtttagc 

atgtcttaca 

gtgtgcttta 

gccttgaact 

tgctaggact 

aatcacctag 

cctgagttta 

ctggcctcca 

aaaatgaagc 

cggcatactg 

aaaagaactc 

cggggaactt 

ttgcacccac 

tattggctgc 

aatcaaggtt 

caactccatc 

cctccgggag 

aaaaacttta 



tggctcccca 

agcattgaac 

accccatagg 

atgaagaaag 

cttcgttggc 

tgttgtggaa 

gatagtgttc 

aacctcttcc 

aacataaaaa 

gtctgtttct 

aactcttggc 

tctcacttgg 

gcagggggca 

gctaaatgtt 

acaggaaatt 

gaatagctgg 

aaaaagtgat 

gcactgtcac 

ctctagtcta 

ttacaaagac 

taactgctgt 

tctgatcgcc 

aaaagcacat 

gggctggaga 

gttcccagaa 

tgggcatctg 

ctttaaaaac 

tgtaagtcca 

a 999tgtgct 

gggcagtgaa 

tactggacaa 

attggctgca 

agatgagtgg 

aaaaagcgac 

gtgatcaagg 

ataaagaaat 



gtcgattggg 

tctccactct 

acagccacag 

agcctttggg 

tttgcctcgt 

aagccaaccc 

tatccatgga 

tcagacccag 

tggtttctct 

ttgctgttct 

ccatactaag 

cctttagcac 

ctgttgaggc 

ttaatcaaga 

gtctttttca 

atatttaact 

tcagtggacc 

ctgccaggca 

ccaccaatgt 

gtctgttttg 

ggtggtgctg 

tgcccctgcc 

gccaccacac 

gagggttcca 

cctatactgg 

ctcttagcac 

ctcctaaaac 

tctcatggtg 

gggggttggg 

aattctttgc 

ctccagggaa 

tttgcgtaac 

ttgctgtgag 

agaaagaacg 

ttagccctca 

aatgaaaaaa 
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gacatcaggg 


cagatccttg 


gggccaaagg 




tagaagcgga 


tgcatgagca 


cgtgccgcag 




atggatgtga 


gtg-gtcggc 


gtcggcgcac 




atctttggtg 


agctgtggag 


gggaaatggg 


5 


ttcaagaact 


gtctattaca 


attatctcaa 




aaaaaaccta 


tccaggtgtg 


gtggtgtgca 




gcaagaggat 


ggcgagtttg 


aaggtatctg 




ccaaaccaaa 


cagcaaaccc 


attatgtcac 




gagagcatgg 


ggtgggggtg 


ggggtggggg 


10 


atccactgag 


accctggggc 


ttgactgcag 




gttgagtaaa 


agcatcgatt 


actgacttaa 




caacaaaagc 


caaaccaagg 


ggctggtgag 




tcttccgaag 


gtccagagtt 


caaatcccag 




acgagatatg 


atgccctctt 


ctggtgtgtc 


15 


taaataaatc 


ttaaaaaaaa 


aaaaaaaaaa 




aacagaaggc 


aggcacgacg 


gcaggcacca 




catgggccga 


ggagggtcca 


gagagatagg 




tttcttgttg 


acactacttc 


aattacagac 




ggccactctc 


tgctcgcttg 


atttttcctg 


20 


aggacagatc 


gcatcattaa 


ggtggccagc 




gaaccattat 


ggtgctcaca 


gaatcccggg 




caataggaaa 


gctttctggc 


ccagacaggg 




aggagactgg 


aggtcacaga 


caaaagggcc 




agagatagat 


cacccccaac 


aatggccaca 


25 


cttaggaagc 


aggtatcaga 


gtccccttcc 




gtcagagcag 


ctgcattgat 


gtgtgggtga 




cgccacagat 


ggaccggcca 


cttacaagtc 




tgcaggtgga 


cgacactgat 


tcattaccca 




acagcctcct 


ncccagtcct 


cctccagggc 


30 


cagcttccgc 


cagcccctcc 


tccctttgca 




ctccctgtgg 


catggccctc 


ctgctactgc 




atagacccga 


gaCggctttc 


tgatttatat 




aggacccaga 


gccgcctgcg 


ataccataag 




tgctgtacaa 


gcatgaagac 


atgagttcga 


35 


tctzaacctca 


gagttgaggg 


gaaaggcagg 




cagcctaacc 


taaacgtctc 


agtcagagat 



cggacaggcg agtctcgtgg taaggtcgtg 
gcatcatgag agagccctag gtaagtaagg 
tgcacgtcct ggctgtggtg ctggactggc 
tagggagatc ataaaatccc tccgaattat 
aatattaaaa aaaaagaaga attaaaaaac 
cctatagcca cgggcacttg gaaagctgga 
ggyctgcaca gcaagaccgt cgtccccaaa 
acaagagtgt ttatagtgag cggcctcgcl 
acagaaatan ctaaactgca gtcaataggg 
cttaaccttg ggaaatgata agggttttgt 
cctcaaatga agaaaaagaa aaaaagaaaa 
atggctcagt gggtaagagc acccgactgc 
caaccacatg gtggctcaca accatctgta 
tgaagacagc tacagtgtac ttacatataa 
aaaagccaaa ccgagcaaac caggccccca 
cgagccatcc tgtgaaaagg cagggctacc 
ctggtaagct cagtttctct gtatacccct 
aaaataacaa ataaacaaaa tctagagcct 
ttacgtccag caggtggcgg aagtgttcca 
ataatctccc atcageaggt ggtgctgtga 
cccaggagct gccctctccc aagtctggag 
ttaacagtcc acattccaga gcaggggaaa 
agcttctaac aacttcacag ctctggtagg 
gctggttttg tccgccccga aggaaactga 
tgaggggact tctgtctgcc ttgtaaagct 
cagaagatga aaaggaggac ccaggcagat 
gaggcaggtg gcagagcctt gcagaagctc 
gttagcatac cacagcgcgc taggcggacc 
tggggagtcc tccaaccttc tgtctcagtg 
ccncaggtgt gaaccctccc tcctctcctt 
aggctgagca ttggatttct ttgtgcttag 
atatatatcc atcccttgga tcttacatct 
a 99ccgggga gatgatatgg taagagcgct 
atccccagca accatgtgga aaaataacct 
tggattctgg gggcttactg gccagctagc 
cctgtctcag ggaataactt gggagaatga 
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ctgagaaaga cacc tcctca ggcctcccat gcacccacac agacacacgg ggggggggta 
atgtaataag ctaagaaata acgagggaaa tgattttttg ctaagaaatg aaactctgtg 
ttggccgcaa gaagcctggc cagggaagga actgcctttg gcacaccagc ctataagtca 
ccatgagttc cctggctaag aatcacatgt aatggagccc aggtccctct tgcctggtgg 
5 ttgcctctcc cactggt i:tt gaagagaaat tcaagagaga cctccttggt cagaattgta 
ggtgctgagc aatgtggagc tggggtcaat gggattcctt taaaggcatc cttcccaggg 
c tgggtcata c ttcaatagt agggtigcttg cacagcaagc gtgagaccct aggttagagt 
ccccagaatc tgcccccaac cccccaaaaa ggcatccttc tgcccctggg tgggtggggg 
gagcaaacac ctttaactaa gaccattagc tggcaggggt aacaaatgac cctgqctaga 
10 ggaatttggc caagctggat tccgecttct gtagaagccc cacttg^tcc cttcgttaag 
ctggcccaca gtttgttttg agaatgcctg aggggcccag ggagccagac aattaaaagc 
caagctcatc ntgatatctg aaaaccacag cctgactg cc ctgcccgtgg gaggtaccgg 
gagagctggc tgtgtccctg cctcaccaac gccccccccc ccaacacaca ctcctcgggt 
cacctgggag gtgccagcag caatttggaa gtttactgag cttgagaagt cttgggaggg 
15 ctgacgctaa gcacacccct tctccacccc cccccacccc acccccgtga ggaggagggt 
gaggaaacat gggaccagcc ctgctccagc cc gtccttat tggctggcat gaggcagagg 
gggctttaaa aaggcaaccg tatctaggct ggacactgga gcctgtgcta ccgagtgccc 
tcctccacct ggcagcatgc agccctcact agccccgtgc ctcatctgcc tacttgtgca 
cgctgccttc tgtgctgtgg agggccaggg gtggcaagcc ttcaggaatg atgccacaga 
20 ggtcatccca gggcttggag agtaccccga gcctcctcct gagaacaacc agaccatgaa 
ccgggcggag aatggaggca gacctcccca ccatccctat gacgccaaag gtacgggatg 
aagaagcaca ttagtggggg ggggggtccc gggaggtgac tggggtggtt ttagcatctt 
cctcagaggt ttgtgtgggt ggctagcctc tgctacatca gggcagggac acacctgcct 
ggaagaatac tagcacagca ttag^acctg gagggcagca ttgggggcct ggtagagagc 
25 acccaaggca gggtggaggc tgaggtcagc cgaagctggc attaacacgg gcatgggctt 
gtaugatggt ccagagaatc tcctcctaag gatgaggaca caggtcagat ctagctgctg 
accagtgggg aagtgatatg gtgaggctgg atgccagatg ccatccatgg ctgtactaca 
tcccacatga ccaccacatg aggtaaagaa ggccccagct tgaagatgga gaaaccgaga 
ggctcctgag ataaagtcac ctgggagtaa gaagagctga gactggaagc tggtttgatc 
30 cagatgcaag gcaaccctag attgggtttg ggtgggaacc tgaagccagg aggaatccct 
ttagttcccc cttgcccagg gtctgctcaa tgagcccaga gggctagcat taaaagaaca 
gggtttgtag gtggcacgtg acatgagggg cagccgagng aaacgtcccc tgtatgagca 
caggtggcac cacttgccct gagcttgcac ccrgacccca gcttitgcctc attcctgagg 
acagcagaaa ccgtggaggc agagccagca cagagagatg cctggggtgg gggtgggggt 
35 atcacgcacg gaactagcag caatgaatgg ggzggggtgg cagctggagg gacactccag 
agaaatgacc ttgctggtca ccatttgtgt gggaggagag ctcatcttcc agcttgccac 
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cacatgctgt ccctcctgtc 
aggagcatgc aacgcagtca 
tcggaaagcc ccaaaactag 
tgcaggggtg agtaaattaa 
gttgtgcccc cccccccccc 
ctgggatttt ttttctcctt 
tagacaggac ccacacatgg 
agatagagca tgtccaagag 
ccaacccaat ctccttcccc 
accagctcaa ggtacacaga 
tgtggtagca aaagctatag 
cgaaaggccc tctactcgcc 
tcacccttgc gggagccaga 
gcaagcagcc cagccctgcc 
gagggggttg gacgcgccgc 
acctgccctt ctaccccgct 
ctccactcct ttctccatct 
ttaggaagga ccagagccag 
ctaaactgga agagtgtttg 
gagaagtgga gatgggcgct 
ttgaggccag cctgggctac 
gccacgtagt gcccgatgta 
acagatgtgg ggaaaagcaa 
acagagctcc agtgtttcat 
atctcactgc tccccggtgc 
ggtttggaaa aagagagaag 
tcacgcatcc ctctctccgc 
cgcttcctga cagacggccc 
ggccagtgcg gccccgcgcg 
ccgaacggac cggatttccg 
ctgtgccccg ggggcgcggc 
cgcaagcgcc tcacccgctt 
gcgcggccgc agaagggtcg 
gcggagccgg agaacgccta 
attcgttttc; agtgtaaagc 
tggagttccc agcccagtag 



tcctagccag taagggatgt 
cgtttttgca gaggaagtgc 
tccttccctg ggcaaacagg 
gccagccaca gaagggtggc 
gtgaaggtgc atcctggcct 
ttatgtcata ttgatcctga 
attagttaaa agcctcccat 
aggagggcag gcatcagacc 
ggagaacaga ctctaagtca 
acaagagagt ctggtataca 
get ttgggtc agaactccga 
accgccccgc ccccacctgg 
gagccctggc atcctaggta 
tttggggcaa gttcttttct 
ctttggtcgc ttccaagtct 
cctccacagc agctgtcctg 
cctgggatac cgcccctgtc 
9tgtggctag aggctaccag 
gttagtagge acaaagcctt 
gagaagttca agaccatcca 
ataaaaaccc aatctcaaaa 
atagtggatg aagtcgttga 
ctttaagtac cctgcccaca 
ccctgggttc caaggacagg 
ctccttccta taatccatac 
ggtggaagca gcagaccagt 
agatgtgtcc gagtacagct 
atgccgcagc gccaagccgg 
gctgctgccc aacgccatcg 
ctgcatcccg gatcgctacc 
gccgcgctcg cacaaggtgc 
ccacaaccag teggagetea 
caagccgcgg cccggcgccc 
etagagegag cccgcgccta 
ctgcagccca ggccaggggt 
agaccgcagg tccttctgcc 



ggaggaaagg gccaccccaa 
ttgacctaag ggcactattc 
cctcccccac ataccacctc: 
aaggcctaca cctcccccct 
ctgcccctct ggctttggta 
caccatggaa cttttggagg 
ccatctaagc tcatggtaqg 
tagaagatat ggctgggcat 
gatccagcca cccttgagta 
gcaggtgcta aacaaatgct 
cccaagtcgc gagtgaagag 
ggtcctataa cagatcactt 
gccccccccg cccccccccc 
cagcctggac ctgtgataat 
aatgaattet tatccctacc 
atttattacc ttcaattaac 
ccagtggctg gtaaaggagc 
gcagggctgg ggatgaggag 
gggtgggatc cctagtaccg 
tccttaacta cacagccagt 
gctgccaatt ctgattctgt 
atcctggggc aacctatttt 
gatcacaaag aaagtaagtg 
gagagagaag ccagggtggg 
agattegaaa gcgcagggca 
ctggcctagg ctgcagcccc 
gccgcgagct gcactacacc 
tcaccgagtt ggtgtgctcc 
ggcgcgtgaa gtggtggcgc 
gcgcgcagcg ggtgcagctg 
gtctggtggc ctcgtgcaag 
aggaettegg gccggagacc 
9999agccaa agccaaccag 
tgcagccccc gcgcgatccg 
gccaaacttt ccagaccgtg 
cgctgcgggg gatggggagc 
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gggtggggtc cccgcgggcc aggagaggaa gcttgagtcc cagactctgc ctagccccgg 
gtgggatggg ggtctttcta ccctcgccgg acctatacag gacaaggcag tgtttccacc 
ttaaagggaa gggagtgtgg aacgaaagac ctgggactgg ttatggacgt acagtaagat 
ctactccttc cacccaaatg taaagcctgc gtgggctaga tagggtttct gaccctgacc 
5 tggccactga gtgtgatgtt gggctacgtg gttctctttt ggtacggtct tctttgtaaa 
a tagggaccg gaactctgct gagattccaa ggattggggt accccgtgta gactggtgag 
agagaggaga acaggggagg ggttagggga gagattgtgg tgggcaaccg cctagaagaa 
gctgtttgtt ggctcccagc ctcgccgcct cagaggtttg gcttccccca ctccttcctc 
tcaaatctgc ctccaaatcc atatctggga tagggaaggc cagggtccga gagatggtgg 
10 aagggccaga aatcacactc ctggcccccc gaagagcagt gtcccgcccc caactgcctt 
gucatattgt aaagggattt tctacacaac agtttaaggt cgttggagga aactgggctt 
gccagtcacc tcccatcctt gtcccttgcc aggacaccac ctcctgcctg ccacccacgg 
acacatttct gtctiagaaac agagcgtcgt cgcgctgtcc tctgagacag catatcttac 
attaaaaaga ataatacggg gggggggggc ggagggcgca agtgttatac atatgctgag 
15 aagctgtcag gcgccacagc accacccaca atctttttgt aaatcatttc cagacacctc 
ttactttctg tgtagatttt aattgttaaa aggggaggag agagagcgtt tgliaacagaa 
gcacatggag gggggggtag gggggttggg gctggcgagt ttggcgaact ttccatgtga 
gactcatcca caaagactga aagccgcgtc ttttttttta agagttcagt gacatattta 
ttttctcatt taagttattt atgccaacat ttttttcttg tagagaaagg cagtgttaat 
20 atcgctttgt gaagcacaag cgtgtgtggt tttttgtttt ttgctttttc cccgaccaga 
ggcattgtta ataaagacaa tgaatctcga gcaggaggct gtggtcttgt tttgtcaacc 
acacacaatg tctcgccact gtcatctcac tcccttccct tggtcacaag acccaaacct 
tgacaacacc tccgactgct ctctggtagc ccttgtggca anacgcgctc cctt.t.gaaaa 
gtcacattca tcctttcctt tgcaaacctg gctctcattc cccagctggg tcatcgtcat 
25 accctcaccc cagcctccct ttagctgacc actctccaca ctgtcttcca aaagcgcacg 
tttcaccgag ccagttccct ggtccaggtc atcccattgc tcctcct-gc tccagaccct 
tctcccacaa agatgttcat ctcccactcc atcaagcccc agtggccctg cggctatccc 
tgtctcttca gttagctgaa tctacttgct gacaccacat gaattccttc ccctgtctta 
aggttcatgg aactcttgcc tgcccctgaa ccttccagga ctgtcccagc gtctgatgtg 
30 tcctctctct tgtdaagccc caccccacta tttgattccc aattctagat ' ctticccttgc 
tcattccttc acgggatagt gtctcatctg gccaagtcct gcttgatatt gggataaa-g 
caaagccaag tacaattgag gaccagttca tcattgggcc aagctttttc aaaatgtgaa 
ttttacacct atagaagtgt aaaagccttc caaagcagag gcaatgcctg gctcttcctt 
caacatcagg gctcctgctt tacgggcccg gtggggtagt acattcataa acccaacact 
35 aggggtgtga aagcaagatg atcgggagtt ccaggccaac cttggctatg aggccctgtc 
tcaacctctc ctccctcccc ccagggtttt gttttgtttt gtttttttga tttgaaactg 
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caacacttta aatccagtca agtgcatctt tgcgtgaggg gaactctatc cctaatataa 
gcttccatct tgatttgtgt atgtgcacac tgggggttga acccgggcct ttgtacctgc 
cgggcaagct ccctactgct ctaaacccag ccctcactgg ctttctgttt caactcccaa 
tgaatt cccc taaatgaatt atcaatatca tgtctttgaa aaacaccatt gagtgctgct 
5 ggtgtccctg tggttccaga ttccaggaag gacttttcag ggaatccagg catcctgaag 
aatgtcttag agcaggaggc catggagacc ttggccagcc ccacaaggca gtgtggcgca 
gagggtgagg atggaggcag gcttgcaatt gaagctgaga cagggtactc aggattaaaa 
agcttccccc aaaacaattc caagatcagt tcctggtact tgcacctgtt cagctacgca 
gagcccagtg ggcataggtg aagacaccgg ttgtactgtc atgtactaac tgtgcttcag 
10 agccggcaga gacaaataat gttatggtga ccccagggga cagtgattcc agaaggaa^a 
cagaagagag tgctgctaga ggctgcctga aggagaaggg gtcccagact ctctaagcaa 
agactccact cacataaaga cacaggctga gcagagctgg ccgtggatgc agggagccca 
cccaccatcc tttagcatgc ccttgtattc ccatcacatg ccagggatga ggggcatcag 
agagtccaag tgatgcccaa acccaaacac acctaggact tgcttcctgg gacagacaga 
15 tgcaggagag actaggttgg gctgtgatcc cattaccaca aagagggaaa aaacaaaaaa 
caaacaaaca aacaaaaaaa aacaaaacaa aacaaaaaaa aacccaaggt ccaaattgta 
ggtcaggtta gagtttattt atggaaagtt atattccacc tccatggggt ctacaaggct 
ggcgcccatc agaaagaaca aacaacaggc tgatctggga ggggtggtac tctatggcag 
ggagcacgtg tgcttggggt acagccagac acggggcttg tattaatcac agggcttgta 
20 ttaataggct gagagtcaag cagacagaga gacagaagga aacacacaca cacacacaca 
cacacacaca cacacacaca catgcacaca ccactcactt ctcacrcgaa gagcccctac 
ctacattcta agaacaaacc attcctcctc ataaaggaga caaagctgca gaaacccaaa 
agagccacag ggtccccact ctctttgaea tgacttggac ttgttgcagg gaagacagag 
gggtctgcag aggcttcctg ggtgacccsg agccacagac actgaaatct ggtgctgaga 
25 cctgtataaa ccctcttcca caggttccct gaaaggagcc cacatncccc aaccctgtct 
cctgaccact gaggatgaga gcacttgggc cttccccatt cttggagrgc accctggttt 
ccccatctga gggcacatga ggtctcaggt cttgggaaag ttccacaagt attgaaagcg 
ttcttgtttt gtttgtgatt taatttaggt gtatgagtgc ttttgctcga atatatgcct 
gtgtagcatt tacaagcctg gtgcctgagg agatcagaag atggcatcag ataccctgga 
30 actggacctg cagacagtta tgagccactg tgtgggtgct aggaacagaa cctggatcct 
ccggaagagc agacagccag cgctcttagc cactaagcca tcactgaggt cctttctgtg 
gctaaagaga caggagacaa aggagagttc cttttagtca ataggaccat gaatgttcct 
cgtaacgtga gactagggca gggtgatccc ccagtgacac cgatggcccc gtgtagttat 
tagcagccct agtcttattc cttaataagt cccagtttgg ggcaggagat atgtattccc 
35 tgctttgaag tggctgaggt ccagttatct acttccaagt acttgttcct ctttctggag 
ttggggaagc cccctgcctg cctgcaaatg tgtccattct tcaacctcag acaagatcac 
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tttccctgag 

ctttcgttgg 

cggatctctg 

ctacacagag 

caaaaacgaa 

tttttgtccc 

tcatccggac 

tctggggctg 

agcaaggagt 

ctgatgtgca 

tccacagacc 

tgggctgggc 

tgggattaca 

ccagatctga 

ggattctact 

gcagttctta 

agcgaagcag 

agaaagggtc 

atacatggaa 

caggttcacc 

tactgtcagt 

agctccctga 

tcacattctc 

atatgattcc 

acctggagct 

gggaggaaga 

ccagtctctc 

agaaaggaga 

actgagtacc 

gtgggactcc 

gatctgaatg 

aagctacctt 

tctctgtatc 

agaaatcccc 

ggcccaaagc 

gttcattgct 



cagtcaggcc 

gtggaggtag 

tgagttcaca 

aaaccctgtc 

caaacagaaa 

tttgccattt 

acacccagaa 

ggtggattgg 

at 9tggtgca 

ggtatgggca 

cctcccccct: 

tggctagctt 

gtcatatatg 

gtgcttgcaa 

ttctattaaa 

ggcctgctga 

aggacaaaga 

agggaaggaa 

cctggtgtac 

acactgggaa 

catgcctgaa 

agaaaggcaa 

tgtatgccta 

tccttttcct 

ggcagtgggc 

tgaaggaccc 

ctacagatgg 

gggtgaggca 

agctgcgtgg 

tagcccagtg 

tgqcccagcc 

tttgtttgtt 

accctagccg 

ctgcctctgc 

tactttaaga 

gtagaattgg 



agtccaaagc 

cacttgcctt 

gccagcctgg 

tcgaaaaaaa 

aacaagccag 

tagactaaaa 

ccagatgtat 

ttagatggca 

tagcaaacga 

catgcaagca 

tttaacatgg 

gtaggtccca 

agcacacctg 

ggcaatcggt 

gtatttctat 

gagtcaagtg 

aatgaaaact 

atgaaccaga 

atgttatctg 

acagaagcca 

gaaatggaag 

gaagctgggc 

gcagggcagt 

acagtcatct 

agctgcagtg 

agaggaccac 

gagacagagg 

gagaccaaag. 

cagacagcca 

ctggcagagg 

ctagtttcct 

tgttttggtt 

tcctggaact 

ctcctaagtg 

gagagagagg 

agtcttcata 



ccttcaattt 

gaatcccagc 

tctacggagt 

caaaaacaaa 

agtgtttgtc 

gactcgggaa 

ggaagatggc 

tgggctgggt 

ggaagtttgc 

gaagccaagg 

gcatctctca 

gggatctgca 

gcttttttat 

tgaatgactg 

taaatcaatg 

gggagtgaga 

tgggattcga 

tgaatagagg 

catggggttt 

aaaagaagag 

caattaacga 

tgtgggcact 

actggagact 

ggggcccctg 

atagatgtct 

cgacctctgc 

cgagagatga 

gagggaaaca 

atgccaaggc 

ggagtgctga 

tccagttgct 

tggttttgtt 

cactctgtag 

ctggaattaa 

aatgtataag 

ttccaggtaa 



agctttcata 

attaagaagg 

gagttccaag 

agaaataaag 

cccgtatttt 

agcsggtctc 

taatgugctg 

gtggttacga 

acagaacaac 

gacagcctta 

ttggcctgga 

tatctctgcc 

gtgggttctg 

cttcatctcc 

agcccctgcc 

gcaagcctcg 

ggctcgggat 

caggaagggt 

gcattgcaat 

taggtggtgt 

tgcgccgcaa 

gaagggagct 

gagacttgac 

agcttcgtcc 

gcaagaaaga 

tgcccgacaa 

atggtcaggg 

cttgtgctct 

tcggctgatc 

atggcgcatg 

gggataaagc 

tggtttttcg 

accaggctgg 

aggcctgcgc 

tattataact 

tctcccacag 



aggaacaccc 

cagagacagt 

acagccaggc 

aaaaagaaaa 

attaaccata 

tctctgttnc 

cagttgcaca 

tgactgcagg 

actgtgcgta 

gggtagtgtt 

gcttgccaac 

tccctagtgc 

ggctttgaac 

ccagaccctg 

cctgcactca 

agaccccatc 

atggagatac 

agggccctgc 

ggctcttcag 

tggagtcaga 

ttaggatatt 

ttgaatgatg 

ttgtgtgtcc 

ttgtccaaga 

tctgaaaaga 

agctgcagga 

gaggagtcag 

acagctactg 

atggcacctc 

gtttggatat 

accctgacca 

aggcagggtt 

cctcgaactc 

caccactgcc 

ccaggttiata 

acatgccaca 



25980 

26040 

26100 

26160 

26220 

26280 

26340 

26400 

26460 

26520 

26580 

26640 

26700 

26760 

26820 

26880 

26940 

27000 

27060 

27120 

27180 

27240 

27300 

27360 

27420 

27480 

27540 

27600 

27660 

27720 

27780 

27840 

27900 

27960 

28020 

28080 



WO 00/32773 



PCT/US99/27990 



29 

aaacaacctg ttctacgaaa tctctcatgg actcccttcc ccagtaattc taaactgtgc 28140 

caaatctaca agaaatagtg acagtcacag tctctaacgt tttgggcatg agtctgaagt 28200 

ctcattgcta agtactggga agatgaaaac tttacctagt gtcagcattt ggagcagagc 28260 

ctttgggatt tgagatggtc ttttgcagag ctcctaatgg ctacatggag agagggggcc 28320 

5 tgggagagac ccacacacct tttgctgcct tatgtcacct gacctgctcc ttgggaagct 28380 

ctagcaagaa ggccttccct ggatcaccca ccaccttgca cctccagaac tcagagccaa 28440 

attaaacttt cttgctactg tcgtcaaagc acagtcggtc tgggttgtat cactgtcaat 28500 

999 aaaca 9 a cttgcccgga tggataactc gtacattgca taatgtctag aaatgaaaag 28560 

tcctatagag aaaaagaaaa ttagctggca cacagataga ggccctggag gaggctggct 28620 

10 ttgtcctccc cgaggaggtg gcgagtaagg tgtaaatgtt catggatgta aatgggccca 28680 

tatatgaggg tctggggtaa caagaaggcc tgtgaatata aagcactgaa ggtatgtcta 28740 

gtctggagaa ggtcactaca gagagttctc caactcagtg cccatacaca cacacacaca 28800 

cacacacaca cacacacaca cacacacaca ccacaaagaa aaaaaggaag aaaaatctqa 28860 

gagcaagtac agtscttaaa attgtgtgat tgtgtgtgtg actctgatgt cacatgctca 28920 

15 tcttgcccta tgagttgaaa accaaatggc ccctgagagg cataacaacc acactgttgg 28980 

ctgtgtgctc acgtttttct taaagcgtct gtctggtttg ctgctagcat caggcagact 29040 

tgcagcagac tacatatgct cagccctgaa gtccttctag ggtgcatgtc tcttcagaat 29100 

ttcagaaagt catctgtggc tccaggaccg cctgcactct ccctctgccg cgaggctgca 29160 

gactctaggc tggggtggaa gcaacgctta cctctgggac aagtataaca tgttagcttt 29220 

20 tctttccctc tgtggctcca acctggacat. aaaatagatg caagctgtgt aataaatatt 29280 

tcctcccgtc cacttagttc tcaacaataa c-tactctgag agcacttatt aataggtggc 29340 

ttagacataa gctttggctc attcccccac tagctcttac ttctttsact ctttcaaacc 29400 

atnctgtgtc ttccacatgg ttagttacct ctccttccat cctggttcgc ttcttccttc 29460 

gagt cgccct cagtgtctct aggtgatgct tgtaagatat tctttctaca aagctgagag • 29520 

25 tggtggcact ctgggagttc aaagccagcc tgatctacac agcaagctcc aggatatcca 29580 

gggcaatgtt gggaaaacct ttctcaaaca aaaagagggg ttcagttgtc aggaqgagac 29640 

ccatgggtta agaagtctag acgagccatg gtgatgcata cctttcatcc aagcactcag 29700 

gaggcaaaga aaggtgaaac tctttgactt tgaggccagc taggttacat agtgaCaccc 29760 

tgcttagtgt gtgtgtgtgt gtgtgtgtgt gtgtgtgtgt gtgtgtaatt taaaagtcta 29820 

30 aaaatgcatt cttctaaaaa tatgtataag tatttgcctg cacacatgta tgtatgtacg 29880 

tataccatgt gtgtgcctgg tgctgaagga ctaggcatag actccctaga actagagtca 29940 

tagacagttg tgacactccc caacccccca ccatgtgggt gcttgaagct aaactcctgt 30000 

cctttgtaaa gcagcaggtg tctatgaacc ccgaaccatc tctccagtct ccagatgtgc 30060 

attctcaaag aggagtcctt catatttccc taaaccgaac atccttatca gtgagcatcc 30120 

35 tcgagtcacc aaagctactg caaaccctct tagggaacat tcactattca cttctacttg ■ 30180 

gctcatgaaa cttaagtaca cacacacaaa cacacacaca cacacagagt catgcactca 30240 
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caaaagcatg 


catgtacacc 


attcttatta 


gactatgctt 


tgctaaaaga 


ctttcctaga 


30300 




tactttaaaa 


catcacttct 


gccttttggt 


gggcaggttc 


caagattggt 


actggcgtac 


30360 




tggaaactga 


acaaggtaga 


gatctagaaa 


tcacagcagg 


tcagaagggc 


cagcctgtac 


30420 




aagagagagt 


tccacacctt 


ccaggaacac 


tgagcagggg 


gctgggacct 


tgcctctcag 


30480 


5 


cccaagaaac 


tagtgcgttt 


cctgtatgca 


tgcctctcag 


agattccata 


agatctgcct 


30540 




tctgccataa 


gatctcctgc 


atccagacaa 


gcctagggga 


agttgagagg 


ctgcctgagt 


30600 




ctctcccaca 


ggccccttct 


tgcctggcag 


tattttttta 


tctcgaggag 


aggaatcagg 


30660 




Stgggaatga 


tcaaatacaa 


ttatcaagga 


aaaagtaaaa 


aacacatata 


tatatatatt 


30720 




aactgatcta 


gggagctggc 


tcagcagtta 


agagttctgg 


ctgcccttgc 


ttcagatctt 


30780 


10 


gctttgattc 


ccagcaccca 


catgatggct 


ttcaactgta 


tctctgcttc 


caggggatcc 


30840 




aacagcctct 


tctgacctcc 


atagacaaga 


cctagtcctc 


tgcaagagca 


ccaaatgctc 


30900 




ttatctgttg 


atccatctct 


ctagcctcat 


gccagatcat 


ttaaaactac 


tggacactgt 


30960 




cccattttac 


gaagatgtca 


ccgcccagtc 


atttgccatg 


agtggatatt 


tcgattcttt 


31020 




ctatgttctc 


acccttgcaa 


tttataagaa 


agatatctgc 


atttgtctcc 


tgagagaaca 


31080 


15 


aagggtggag 


ggctactgag 


atggctctag 


gggtaaaggt 


gcttgccaca 


aaatctgaca 


31140 




acttaagttt 


ggtcttggaa 


tccacatggt 


ggagagagag 


aagagattcc 


cgtaagttgt 


31200 




cctcaaactt 


cccacacatg 


tgctgtggcr 


tatgtgtaac 


cccaataagt 


aaagatagtt 


31260 




ttaaacacta 


cataaggtag 


ggtttcttca 


rgaccccaag 


gaatgatgcc 


cctgatagag 


31320 




cttatgctga 


aaccccatct 


ccattgtgcc 


atctggaaag 


agacaattgc 


atcccggaaa 


31380 


20 


cagaatcttc 


atgaatggat 


taatgagcta 


Ltaagaaagt 


ggcttggtta 


ttgcacatgc 


31440 




tggcggcgta 


atgacctcca 


ccatgatgtt 


atccagcatg 


aaggtcctca 


ccagaagtca 


31500 




tacaaatctt 


cttaggcttc 


cagagtcgtg 


agcaaaaaaa 


gcacacctct 


aaataaatta 


31560 




actagcctca 


ggtagttaac 


caccgaaaat 


gaaccaaggc 


agttctaata 


caaaaccact 


31620 




tcccttccct 


gttcaaacca 


cagtgcccta 


ttatctaaaa 


gataaacctc 


aagccaagct 


31680 


25 


tttaggttgc 


cagtatttat 


gtaacaacaa 


ggcccgttga 


cacacatctg 


taactcctag 


31740 




tactgggcct 


caggggcaga 


gacaggtgga 


gccctggagt 


ttgaattcca 


ggttctgtga 


31800 




gaaactctgt 


ctgaaaagac 


aatatggtga 


gtgacccggg 


aggatatctg 


atattgactt 


31860 




ctggccaaca 


cacagccatc 


tctgcacatc 


tgtagttgca 


agccttttgc 


actaagtttg 


31920 




gccagagtca 


gagtttgcaa 


gtgtttgtgg 


actgaatgca 


cgtgttgctg 


gtgatctaca 


31980 


30 


aagtcaccct 


cctcctcaag 


ctagcagcac 


tggcttcggc 


cagctgctca 


ttcaagcctc 


32040 




tttgcagagt 


catcacgggg 


atcggggagc 


agggcccc.tc 


cctagaacac 


caagcctgtg 


32100 




gttgcttatt 


caggacatta 


ttgagggcca 


agatgacaga 


taactctatc 


acttggccaa 


32160 




cagtcgggtg 


ttgcggtgtt 


aggttatttc 


tgtgtctgca 


gaaaacagtg 


caacctggac 


32220 




aaaagaaata 


aatgatatca 


tttttcattc 


aggcaactag 


attccgtggt 


acaaaaggct 


32280 


35 


ccctggggaa 


cgaggccggg 


acagcgcggc 


tcctyagtcg 


ctatttccgt 


ctgtcaactt 


32340 




ctctaatctc 


ttgatttcct 


ccctctgtct 


gtttccttcc 


tcttgctggg 


gcccagtgga 


32400 
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gtctgtgtac 
gggggaagct 
aatttaagat 
ccccgccctg 
aaaatgtggc 
ctctgccagc 
ccaacttttg 
agggaaggct 
ttagttagct 
ggtttccgcc 
caggtgggcc 
atatgagctc 
tgagaactgg 
gcagagatct 
ctgtctcaag 
acacacatgc 
caaatacata 
ctcctcagtc 
cttcatcttg 
tccgtcattc 
gataaagaca 
ttctgttcac 
cttcaccaca 
tccaaccagc 
tttccctcct 
aacgat taca 
gctgggatga 
ccgaagctga 
cctaccacat 
tggtggtgct 
ttzttctgctt 
gcagaaagat 
atagttactg 
gagactctaa 
gaatggaccc 
atggtctgga 



tcacagggag 

gtcggcccag 

aatgagtctc 

tcaatctggc 

tggaccgtgt 

cccggtgcct 

atgatggtgt 

cttcagggag 

tccacctggt 

ctcatccacc 

attggtcact 

actgagtcct 

agttcaattc 

taaagctcct 

aaacaagatg 

acatacatcc 

cataaaaaaa 

tcctggctac 

actcctctgt 

ataaactccc 

cacactacaa 

ccggccagct 

gctcctccat 

acaggrccag 

gaatctacca 

aggaagccaa 

attcacattt 

gtccagaaag 

cagctgcccc 

ggtggtggtg 

ttacaaaact 

gactcacgcc 

tcaaaagtaa 

aatccacatt 

agatagaagg 

cagaagcatc 



gagggtggca 

tgactttttc 

ctcattcacg 

taagtaagac 

gccggcacga 

tttcctttcg 

gcgccgggtg 

tcacctagcc 

cccttatccg 

ttgccctttt 

ccgctaccac 

gcgggatcgc 

ccagcacatg 

ggccagacag 

gaacaccaaa 

acacacaggc 

taaatacata 

gctcttgtca 

actttgcatg 

tctaaatttc 

agtcaccgtg 

aagtagtcca 

gctacccagc 

tctggcttct 

ccttctttct 

tgaaattagc 

ccaggccttt 

ctgaacctaa 

tgactgagct 

gcggtggtgg 

tttctaattc 

tatataagat 

ttctatttat 

atgtatttaa 

tcacggcaca 

cagagagggc 



aagccctggt 

ccctttctct 

tgtgctcact 

aagtcaaatt 

aaccagggat 

gaaaggagac 

actctttaaa 

ctcccttcag 

ctgtctctgc 

agttcctaga 

cgttaccatg 

ttggttggta 

gatgtatttc 

cccagcccaa 

ggtcaacctc 

aaacacatgc 

cacacataca 

cccccactaa 

ccttttccag 

ttcccctgcc 

ggaccagttt 

acctctctgg 

cctgcaaacc 

atgtcctaga 

cccttctcct 

agtttggggt 

gctttgctcc 

aatcacactc 

gtgtctgggt 

tggtggtggt 

ttatacaaag 

ataaagatga 

acacccttat 

aaaaaaattc 

agtatggagt 

ccaagacaaa 



cctctacggg 
ttttcttaga 
attcataggg 
taaaagggaa 
ggcggtctaa 
ccggaggtaa 
atgtcatcca 
gaaaagattc 
ccacnagtcc 
aagcagcacc 
gccaccaagg 
atatigcttgc 
cagcacctgg 
tt agtaatca 
ttgtctccac 
acacacctga 
tacatacacc 
ggcttcaact 
caaaggcttt 
ct tttctttc 
attcacccac 
tgctgtaccc 
ttcagcctag 
aatctccttc 
gacctctaat 
acctcagagt 
ccggattctg 
cagctgggtt 
ggcegtggtg 
ggtggtggtg 
gacaaatctg 
ctttatgcca 
acatggtatt 
cccagtcctt 
cggagtgtgg 
tgcctcgcct 



ctgggggaac 
aaccagtctc 
acttatccac 
cgtctttcta 
gttacatgct 
aacgaagttg 
tacctgggat 
cacctccggt: 
tcatccatcc 
gcagtcttgg 
tgtcatttaa 
tgcaaaatcg 
aaggcaggga 
gtgagagacc 
acacacaaat 
acaccctcca 
aacattccct 
tcttctattn 
tctttaaatc 
tctctaggga 
ccacccccgc 
tggaccctgg 
cctccggttc 
attctctcca 
gccttggtca 
cagcagggga 
acaggcagtt 
ctgaggcagc 
ctggtggtgc 
tgtgtgtgtg 
cctcatatag 
cttattagca 
gcc tttgttg 
aaaaggtgaa 
agtcctgcca 
cctaaggaac 



32460 

32520 

32580 

32640 

32700 

32760 

32820 

32880 

32940 

33000 

33060 

33120 

33180 

33240 

33300 

33360 

33420 

33480 

33540 

33600 

33660 

33720 

33780 

33840 

33900 

33960 

34020 

34080 

34140 

34200 

34260 

34320 

34380 

34440 

34500 

34560 
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actggcagcc 


ccgatgaggt 


accagagatt 


gctaagtgga 


ggaatacagg 


atcagaccca 


34620 




tggaggggct 


taaagcgtga 


ctgtagcacc 


cctccgctga 


ggggctccag 


gtgggcgccc 


34680 




aaggtgctgc 


agtcggagcc 


acatgagacg 


tgatgtcttg 


gagtcacctc 


gggtaccaut 


34740 




gtttagggag 


gcggggattt 


gtggtgtgga 


gacaggcagc 


ctcaaggatg 


cttt tcaaca 


34800 


5 


atggttgatg 


agtcggaact 


aaaacaggcg 


ccatcacact 


ggctcccata 


gccctgggct 


34860 




tgccagcttc 


cacatctgcc 


ccccacc ccc 


tgtctggcac 


cagctcaagc 


tctgtgat zc 


34920 




tacacatcca 


aaagaggaag 


agtagcctac 


tgggcatgcc 


acctcttctg 


gaccatcagg 


34980 




tgagagtgtg 


gcaagcccta 


ggctcctgtc 


caggatgcag 


ggctgccaga 


taggatgcEC 


35040 




agccatcccc 


tgagctggaa 


ctattttagg 


aataaggatt 


atgcccgccc 


ggggttggcc 


35100 


10 


agcaccccag 


cagcctgtgc 


ttgcgtaaaa 


gcaagtgctg 


ttgatttatc 


taaaaacaga 


35160 




gccgtggacc 


cacccacagg 


acaagtatct 


acgcacctgt 


ttcatgtatc 


tgaaaagcga 


35220 




cacaaccatt 


tttcacatca 


tggcatcttc 


ctaaccccca 


ttcttttttg 


ttttgt tttt 


35280 




ttgagacagg 


gtttctctgt 


gtagtcctcg 


ctgtcctgga 


actcactttg 


tagaccaggc 


35340 




tggcctcgaa 


ctcagaaatc 


ctgggattaa 


a 99tgtgtgc 


caccecgccc 


ggccctaacc 


35400 


15 


cccattctta 


scggtgatcc 


agtggttgaa 


atttcgggcc 


acacacatgt 


ccattaggga 


' 35460 




ttagctgctg 


tcttctgagc 


tacctggtac 


aatctttatc 


ccctggggcc 


tgggctcctg 


35520 




atccctgact 


cgggcccgat 


caagtccact 


tcctgggccc 


gatcaagtcc 


agttcctggg 


35580 




cccgaacaag 


tccagtccct 


agctcgatta 


gctcatcctg 


gctccctggc 


ctgttcttac 


35640 




ttacactctt 


ccccttgctc 


tggacttgtt 


gctttcttta 


ctcaagttgt 


ctgccacagt 


35700 


20 


ccctaagcca 


cctctgtaag 


acaactaaga 


taatacttcc 


ctcaagcacg 


gaaagtcctg 


35760 




agtcaccaca 


ccctctggag 


gtgtgtggac 


acatgttcat 


gcctgtggtt 


gcgcttacgt 


35820 




acgtgtgc 












: 35828 




<2 10> 19 












25 


<211> 9301 














<2 1 2 > DMA 














<212 


> > Homo sapien 












<400> 19 












30 


tagaggagaa 


gtctttgagg 


agggtttgct 


ctgagcacac 


ccctttccct 


ccctccgggg 


60 




ctgagggaaa 


catggcacca 


gccctgcccc 


accctgtcct 


cattggctgg 


catgaagcag 


120 




agaggggctt 


taaaaaggcg 


accgtgcccc 


gcctggagac 


cagagcctgt 


gctactggaa 


180 




ggtggcgtgc 


cctcctctgg 


ctggtaccat 


gcagctccca 


ctggccctigt 


gtctcgtctg 


240 




cctgctggta 


cacacagcct 


tccgtgcagc 


ggsgggccag 


gggtggcagg 


cgttcaagaa 


300 


35 


tgatgccacg 


gaaatcatcc 


ccgagcttcg 


acagtacccc 


gagcctccac 


cggagctgga 


360 




gaacaacaag 


accatgaacc 


gggcggagaa 


cggagggcgg 


cctccccacc 


acccctttga 


420 
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gaccaaaggt atggggtgga ggagagaatt cttagtaaaa gatcctgggg aggttttaga 480 

aacttctctt tggcaggctt ggaagactgg ggtagaccca gtgaagattg ctggcctctg 540 

ccagcactgg tcgaggaaca gtcttgcctg gaggtggggg aagaatggct cgctggtgca 600 

gccttcaaat tcaggtgcag aggcatgagg caacagacgc tggtgagagc ccagggcagg 660 

5 gaggacgctg gggtggtgag ggtatggcat cagggcatca gaacaggctc aggggctcag 720 

aaaagaaaag gtttcaaaga atctcctcct gggaatatag gagccacgtc cagctgctgg 780 

taccactggg aagggaacaa ggtaagggag cctcccatcc acagaacagc acctgtgggg 840 

caccggacac tctatgctgg tggtggctgt ccccaccaca cagacccaca tcatggaatc 900 

cccaggaggt gaacccccag ctcgaagggg aagaaacagg ttccaggcac tcagtaactt 960 

10 . ggtagtgaga agagctgagg cgtgaacctg g.tttgatcca actgcaagat agccctggtg 1020 

tgtggggggg tgtgggggac agatctccac aaagcagtgg ggaggaaggc cagagaggca 1080 

cccci_gcagt gtgcattgcc catggcctgc ccagggagct ggcacttgaa ggaatgggag 1140 

ttttcggcac agttttagcc cctgacatgg gtgcagctga gtccaggccc tggaggggag 1200 

agcagcatcc tctgtgcagg agtagggaca tctgtcccca gcagccaccc cagtcccaac 1260 

15 cttgcctcat tccaggggag ggagaaggaa gaggaaccct gggttcctgg tcaggcctgc 1320 

acagagaagc ccaggtgaca gtgtgcatct ggctctataa ttggcaggaa tcctgaggcc 1380 

a t-99999 c 9t ctgaaatgac acttcagact aagagcttcc ctgtcctctg gccattatcc 1440 

aggtggcaga gaagtccact gcccaggctc ctggacccca gccctccccg cctcacaacc 1500 

tgttgggact atggggtgct aaaaagggca actgcatggg aggccagcca ggaccctccg 1560 

20 tcttcaaaat ggaggacaag ggcgcctccc cccacagctc cccttctagg caaggtcagc 1620 

tgggctccag cgactgcctg aagggctgta aggaacccaa acacaaaatg tccaccttgc 1680 

tggactccca cgagaggcca cagcccctga ggaagccaca tgctcaaaac aaagtcatga 1740 

tctgcagagg aagtgcctgg cctaggggcg ctattctcga aaagccgcaa aatgccccct 1800 

tccctgggca aatgcccccc tgaccacaca c^cattccag ccctgcagag gtgaggatgc 1860 

25 aaaccagccc acagaccaga aagcagcccc agacgatggc agnggccaca tctcccctgc 1920 

tgtgcttgct cttcagagtg ggggtggggg gtggccttct ctgccccctc tctggtttgg 1980 

tcttaagact atttttcatt ctttcttgtc acattggaac tacccccatg aaacctttgg 2040 

gggtggactg gtactcacac gacgaccagc tatttaaaaa gctcccaccc atctaagtcc 2100 

accataggag acatggtcaa ggtgtgtgca ggggatcagg ccaggcctcg gagcccaatc 2160 

30 tctgcctgcc cagggagtat caccatgagg cgcccatztca gataacacag aacaagaaat 2220 

gtgcccagca gagagccagg tcaatgtttg tggcagctga acctgtaggt tttgggtcag 2280 

agctcagggc ccctatggta ggaaagtaac gacagtaaaa agcagccccc agctccatcc 2340 

cccagcccag cctcccatgg atgctcgaac gcagagcctc cactcttgcc ggagccaaaa 2400 

ggtgctggga ccccagggaa gtggagtccg gagatgcagc ccagccttcc gggcaagctc 2460 

35 tt ttctctgg ctgggcctca gtattcccat tgataatgag ggggttggac acactgcctzt 2520 

tgattccttt caagtctaat gaattcctgt cctgatcacc tccccttcag tccctcgccc 2580 
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ccacagcagc 


tgccctgatt 


tattaccttc 




gtccacccct 


cccaagtggc 


tggaaaagga 




tgtgctgagt 


acttaccccg 


cceaggccag 




ataagaagac 


cccagaagag 


aaatgataat 


5 


atatgtgcca 


aatggtattt 


tctgcattgc 




ggcggcccat 


tttgcagaca 


ggaagaagag 




ctgcagtgag 


cgatggagcc 


ctggtgtttg 




cagggtgcgc 


aggagagctt 


tccaccagct 




tgttcagggg 


caaaagcctc 


tggagacagg 


10 


gacgggccgg 


tccagggcag 


999tggccag 




ccacagacgt 


gtccgagtac 


agctgccgcg 




ggccgtgccg 


cagcgccaag 


ccggtcaccg 




cgcgcctgcc 


gcccaacgcc 


atcggccgcg 




tccgctgcat 


ccccgaccgc 


taccgcgcgc 


15 


aggcgccgcg 


cgcgcgcaag 


gtgcgcctgg 




gcttccacaa 


ccagtcggag 


ctcaaggact 




gccggaagcc 


gcggccccgc 


gcccggagcg 




cctactagag 


cccgcccgcg 


cccctcccca 




cccacatttc 


tgtcctctgc 


gcgtggtttg 


20 


acccagggca 


gggggctgag 


accttccagg 




cccctcagcc 


cgccagctga 


ggggtcccac 




cactgagcca 


cgcagccccg 


cctctggggc 




aggcagaaat 


ggaagcattt 


tcaccgccct 




aaagtccagg 


gactggttaa 


gaaagttgga 


25 


cagaaagcct 


gaggcgtgcc 


cagagcacaa 




gtcctggctc 


tgccactaac 


ctgctgcgca 




ctcaatttcc 


actttgtaaa 


atgagggtgg 




ggcatatgat 


tccaaggact 


ccagtgcctt 




gaaagagaga 


gaatgaatgc 


agttgcattg 


30 


gagttgtgat 


gctctcttct 


gacagccaaa 




agagtctatt 


tatcgctgac 


atatttacgg' 




ttcccagcct 


ggcctccccg 


gacgtttggc 




catcatccat 


tggcgtagaa 


aaggagaggg 




atccgcccca 


acttcccaaa 


gagcagcacc 


35 


tcaccttccg 


aagagaagtg 


aaaggttcaa 




gccatcacaa 


actcacagac 


cagcacatcc 



aattaacctc 


tactcctttc 


tccatcccct 


2640 


atttgggaga 


agccagagcc 


aggcagaagg 


2700 


ggaccctgcg 


gcacaagtgt 


ggcttaaatc 


2760 


aataatacat 


aacag.ccgac 


gctttcagct 


2820 


gtgtgtaatg gattaactcg 


caatgcttgg 


2880 


agaggttaag 


gaacttgccc 


aagacgacac 


2940 


aaccccagca 


gtcatttggc 


tccgagggga 


3000 


ctagagcatc 


tgggaccttc 


ctgcaataga 


3060 


cttggcaaaa 


gcagggctgg 


ggtggagaga 


3120 


gcgggcggcc 


accctcacgc 


gcgcctctct 


3180 


agctgcactt 


. cacccgctac 


gtgaccgatg 


3240 


agctggtgtg 


ctccggccag 


tgcggcccgg 


3300 


gcaagtggtg 


gcgacctagt 


gggcccgact 


3360 


agcgcgtgca 


gctgctgtgt 


cccggtggtg 


3420 


tggcctcgtg 


caagtgcaag 


cgcctcaccc 


3480 


tcgggaccga 


ggccgctcgg 


ccgcagaagg 


3540 


ccaaagccaa 


ccaggccgag 


ctggagaacg 


3600 


ccggcgggcg 


ccccggccct 


gaacccgcgc 


3660 


attgtttata 


tttcattgta 


aatgcctgca 


3720 


ccctgaggaa 


tcccgggcgc 


cggcaaggcc 


3780 


ggggcagggg 


agggaattga 


gagtcacaga 


3840 


cgcctacctt 


tgctggtccc 


acttcagagg 


3900 


ggggttttaa 


gggagcggtg 


tgggagtggg 


3960 


taagattccc 


ccttgcacct 


cgctgcccat 


4020 


gactgggggc 


aactgtagat 


gtggtttcta 


4080 


accttgaact 


acacaattct 


ccctcgggac 


4140 


aggtgggaat 


aggatctcga 


ggagactatt 


4200 


tcgaatgggc 


agaggtgaga 


gagagagaga 


4260 


attcagtgcc 


aaggtcactt 


ccagaattca 


4320 


gatgaaaaac 


aaacagaaaa 


aaaaaagtaa 


4380 


ctgacaaact 


cctggaagaa 


gctatgctgc 


4440 


tacctccacc 


cctccatctc 


aaagaaataa 


4500 


tccgagggtg gtgggaggga 


tagaaatcac 


4560 


cctcccccga 


cccatagcca 


tgttttaaag 


4620 


ggacactggc 


cttgcaggcc 


cgagggagca 


4680 


cctttgagac 


accgccttct 


gcccaccact 


4740 
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cacggacaca 


tttctgccta 


gaaaacagct 


tcttactgct 


cttacatgtg 


atggcatatc 


4800 




ttacactaaa 


agaatattac 


tgggggaaaa 


actacaagtg 


ctgtacatat 


gctgagaaac 


4860 




tgcagagcat 


aatagctgcc 


acccaaaaat 


ctttttgaaa 


atcatttcca 


gacaacctct 


4920 




tactttctgt 


gtagttttta 


atcgctaaaa 


aaaaaaagtt 


ttaaacagaa 


gcacatgaca 


4980 


5 


tatgaaagcc 


tgcaggactg 


gtcgtttttt 


tggcaattct 


tccacgtggg 


acttgtccac 


504 0 




aagaatgaaa 


gtagtggt tt 


ttaaagagtt 


aagttacata 


tttattttct 


cacttaagtt 


5100 




atttatgcaa 


aagtttttct 


tgtagagaat 


gacaatgtta 


atattgcttt 


atgaattaac 


5160 




agtctgttct 


tccagagtcc 


agagacattg 


ttaataaaga 


caatgaatca 


tgaccgaaag 


5220 




gatgtggtct 


cattttgtca 


accacacatg 


acgtcatttc 


tgtcaaagtt 


gacacccttc 


5280 


10 


tcttggtcac 


tagagctcca 


accttggaca 


cacctttgac 


tgctctctgg 


tggcccttgt 


5340 




ggcaattatg 


tcttcctttg 


aaaagtcatg 


tttatccctt 


cctttccaaa 


cccagaccgc 


5400 




atttcttcac 


ccagggcatg 


gtaataacct 


cagccttgta 


tccttttagc 


agcctcccct 


5460 




ccatgctggc 


ttccaaaatg 


ctgttctcat 


tgtatcactc 


ccctgctcaa 


aagccttcca 


5520 




tagccccccc 


ttgcccagga 


tcaagtgcag 


tttccctatc 


tgacatggga 


ggccttctct 


5580 


15 


gcttgactcc 


cacctcccac 


tccaccaagc 


ttcctactga 


ctccaaatgg 


tcatgcagat 


5640 




ccctgcttcc 


ttagtttgcc 


atccacactt 


agcaccccca 


ataactaatc 


ctctttcttt 


5700 




aggattcaca 


ttacttgtca 


tctcttcccc 


taaccttcca 


gagatgttcc 


aatctcccat 


5760 




gatccctctc 


tcctctgagg 


ttccagcccc 


ttttgtctac 


accactactt 


tggttcctaa 


5820 




ttctgttttc 


catttgacag 


tcattcatgg 


aggaccagcc 


tggccaagtc 


ctgcttagta 


5880 


20 


ctggcataga 


caacacaaag 


ccaagtacaa 


ttcaggacca 


gctcacagga 


aacttcatct 


5940 




tcttcgaagt 


gtggatttga 


tgcctcctgg 


gtagaaatgt 


agcatcttca 


aaagtgggcc 


6000 




agcctcctgc 


acttctctca 


aagtctcg cc 


tccccaaggt 


gtcttaatag 


tgctggatgc 


6060 




tagctgagtt 


agcatcttca 


gatgaagagt 


aaccctaaag 


ttactcttca 


gttgccctaa 


6120 




ggtgggatgg 


tcaactggaa 


agctttaaat 


taagtccagc 


ctaccttggg 


ggaacccacc 


6180 


25 


cccacaaaga 


aagctgaggt 


ccctcctgat 


gacttgtcaa 


tttaactacc 


aataacccac 


6240 




ttgaattaat 


catcatcatc 


aagtctttga 


taggtgtgag 


tgggtatcag 


tggccggtcc 


6300 




cttcctgggg 


ctccagcccc 


cgaggaggcc 


tcagtgagcc. 


cctgcagaaa 


atccatgcat 


6360 




catgagtgtc 


tcagggccca 


gaatatgaga 


gcaggtagga- 


aacagagaca 


tcttccatcc 


6420 




ctgagaggca 


gtgcggtcca 


gtgggtgggg 


acacgggctc 


tgggtcaggt 


ttgtgttgtt 


6480 


30 


tgtttgtttg 


ttttgagaca 


gagtctcgct 


ctattgccca 


ggctggagtg 


cagtgtcaca 


6540 




atctcggctt 


actgcaactt 


ctgccttccc 


ggactcaagt 


gattctcctg 


cctcagcctc 


6600 




cagagtagct 


gggattacag 


gtgcgtgcca 


ccacgcctgg 


ctaatttttg 


tatttttgat 


6660 




agagacgggg 


tttcaccatg 


ttggccaggc 


tagcctcgaa 


ctcttgacct 


caagtgatct 


6720 




gcctgcctcg 


gcctcccaaa 


gtgctgggat 


tacaggcgtg 


agccaccaca 


cccagcccca 


6780 


35 


ggttggtgtt 


taaatctgag 


gagactgaag 


caccaagggg 


ttaaatgttt 


tgcccacagc 


6840 




catacttggg 


ctcagttcct 


tgccctaccc 


ctcacttgag 


ctgcttagaa 


cctggtgggc 


6900 
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acatgggcaa 


taaccaggtc 


acactgtttt 


gtaccaagtg 


ttatgggaat 


ccaagatagg 




agtaatttgc 


tctgtggagg 


ggatgacgga 


tagtggttag 


ggaaagcctc 


acaaagtggg 




tgttgcttag 


agattttcca 


ggtggagaag 


ggggcttcta 


ggcagaaggc 


atagcccaag 




caaagactgc 


aagtgcatgg 


ctgctcatgg 


gtagaagaga 


atccaccatt 


cctcaacatg 


5 


taccgagtcc 


ttgccatgtg 


caaggcaaca 


tggsggtacc 


aggaattcca 


agcaatgtcc 




aaacctaggg 


tctgctttct 


gggacctgaa 


gatacaggat 


ggatcagccc 


aggctgcaat 




cccattacca 


cgagggggaa 


aaaaacctga 


aggctaaatt 


gtaggtcggg 


ttagaggtta 




tttatggaaa 


gttatattct 


acctacatgg 


ggtctataag 


cctggcgcca 


atcagaaaag 




gaacaaacaa 


cagacctagc 


tgggaggggc 


agcattttgt 


tgtagggggc 


ggggcacaug 


It) 


ttctgggggt 


acagccagac 


tcagggcttg 


tattaatagt 


ctgagagtaa 


gacagacaga 




gggatagaag 


gaaataggtc 


cctttctctc 


tctctctctc 


tctctctctc 


actctctci:c 




tctctcacac 


acacacacag 


acacacacac 


acgctctgta 


ggggtctact 


tatgctccaa 




gtacaaatca 


ggccacattt 


acacaaagag 


gtaaaggaaa 


agaacgttgg 


aggagccaca 




ggaccccaaa 


attccctgtt 


ttccttgaat 


caggcaggac 


ttacgcagct 


gggagggtgg 


15 


agagcctgca 


gaagccacct 


gcgagtaagc 


caagttcaga 


gtcacagaca 


ccaaaagcng 




gtgccatgtc 


ccacacccgc 


ccacctccca 


cctgctcctt 


gacacagccc 


tgtgctccac 




aacccggctc 


ccagatcatt 


gattatagct 


ctggggcctg 


caccgtcctt 


cctgccacat 




ccccacccca 


ttcttggaac 


ctgccctctg 


tcttctccct 


tgtccaaggg 


caggcaaggg 




ctcagctatt 


gggcagcttt 


gaccaacagc 


tgaggctcct 


tttgtggctg 


gagatgcagg 


20 


aggcagggga 


atattcctct 


tagtcaatgc 


gaccatgtgc 


ctggtttgcc 


cagggtggtc 




tcgtttacac 


ctgtaggcca 


agcgtaatta 


ttaacagctc 


ccacttctac 


tctaaaaaat 




gacccaatct 


gggcagtaaa 


ttatatggtg 


cccatgctat 


taagagctgc 


aacttgctgg 




gcgtggtggc 


tcacacctgt 


aatcccagta 


ctttgggacg 


tcaaggcggg 


tggatcacct 




gaggtcacga 


gttagsgact 


ggcctggcca 


gcatggcaaa 


accccatctt 


tactaaaaat 


25 


acaaaaatta 


gcaaggcatg 


gtggcatgca 


cctgtaatcc 


caggtactcg 


ggaggctgag 




acaggagaat 


ggcttgaacc 


caggaggcag 


aggttgcagt 


gagccaagat 


tgtgccactg 




ccctccagcc 


ctggcaacag 


agcaagactt 


catctcaaaa 


gaaaaaggat 


actgtcaatc 




actgcaggaa 


gaacccaggt 


aatgaatgag 


gagaagagag 


gggctgagtc 


accatagtgg 




cagcaccgac 


tcctgcagga 


aaggcgagac 


actgggtcat 


gggtactgaa 


gggtgcccrg 


30 


aatgacgtcc 


tgctttagag 


accgaacctg 


agccctgaaa 


gtgcatgcct 


gttcatgggt 




gagagactaa 


attcatcatt 


ccttggcagg 


tactgaatcc 


tttcttacgg 


ctgccctcca 




atgcccaact 


tccctacaat 


tgtctggggt 


gcctaagctt 


ctgcccacca 


agagggccag 




agctggcagc 


gagcagctgc 


aggtaggaga 


gataggtacc 


cataagggag 


gtgggaaaga 




gagatggaag 


gagaggggtg 


cagagcacac 


acctcccctg 


cctgacaact 


tcctgagggc 


35 


tggtcatgcc 


agcagattta 


aggcggaggc 


a 999gagatg 


gggcgggaga 


ggaagtgaaa 




aaggagaggg 


tggggatgga 


gaggaagaga 


999tgatcat 


tcattcattc 


cattgctact 



6960 

7020 

7080 

7140 
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8040 
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gactggatgc cagctgtgag ccaggcacca ccctagctct gggcatgtgg ttgtaatctt 9120 

ggagcctcat ggagctcaca gggagtgctg gcaaggagat ggataatgga cggataacaa 9180 

ataaacactc agtacaatgt ccgggaatgg aaagttctcg aaacaaaaat aaagctggtg 9240 

agcatataga cagccctgaa ggcggccagg ccaggcattt ctgaggaggt ggcatttgag 9300 

5 c 9301 



<2 10> 19 
< 211 > 21 
<212 > DNA 

H) <213> Artificial Sequence 

< 220 > 

<223 > Primer for PCR 

15 <400> 19 

ccggagctgg agaacaacaa g 21 

< 2 10 > 20 
<2ll> 19 

20 <212 > DNA 

<213> Artificial Sequence 



25 



< 220 > 

<223 > PRimer for PCR 
< 4 00 > 20 

gcactggccg gagcacacc 



19 



< 210 > 21 
30 <2ll> 23 

<212> DNA’ 

<213> Artificial Sequence 



35 



< 220 > 

<223> Primer for PCR 
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<400> 21 

aggccaaccg cgagaagatg acc 

< 210 > 22 
5 < 2 11 > 21 

<212> DNA 

<213> Artificial Sequence 
< 220 > 

H) <223> Primer for PCR 

< 4 00 > 22 

gaagtccagg gcgacgtagc a 

15 <2 10 > 23 

<2 11 > 25 
<212 > DNA 

<213> Artificial Sequence 

20 <220 > 

<223 > Primer for PCR 

<4 00 > 23 

aagcttggta ccatgcagct cccac 
25 

<2 10> 24 
<2 11> 50 
. <212> DNA 

<2 13 > Artificial Sequence 
30 

< 220 > 

<223> Primer for PCR 
<400> 24 



23 



21 



25 



35 



aagcttctac ttgtcaccgt cgtccttgta gtcgtaggcg ttctccagct 



50 
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<2 10 > 


25 




<2 1 1 > 


19 




<212 > 


DNA 




<2 13 > 


Artificial 


Sequence 


<22 0> 


<223 > 


Primer for 


PCR 


<4 00 > 


25 





10 gcactggccg gagcacacc 



<2 10 > 26 
<211> 39 
<212> DNA 

15 <213> Artificial Sequence 

< 220 > 

<223 > Primer for PCR 

20 <4 00 > 26 

gtcgtcggat ccatggggtg gcaggcgttc aagaatgat 

<210> 27 
<2ll> 57 

25 <212> DNA 

<213> Artificial Sequence 

<22 0 > 

<22 3> Primer for PCR 
30 

<4 0 0 > 27 

gtcgtcaagc ttctacttgt catcgtcctt gtagtcgtag gcgttctcca gctcggc 



<21 0> 23 
35 <21 1> 29 



19 



39 



57 



<212> DNA 
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<2i3> Artificial Sequence . 



< 220 > 

<223 > Primer for PCR 
<4 00 > 28 

gacttggatc ccaggggtgg caggcgttc 

<210> 29 
<2ll> 29 
<2 12 > DNA 

<213 > Artificial Sequence 



29 



<220 > 

<223 > Primer for PCR 
<4 00 > 29 

agcataagct tctagtaggc gttctccag 

<210> 30 
<2 11> 29 
<2 12> DNA 

<213> Artificial Sequence 
<220 > 

<223> Primer for PCR 
<4 00> 3C 

gacttggatc cgaagggaaa aagaaaggg 

<210> 31 
<211> 29 
<212> DNA 

<213> Artificial Sequence 



29 
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<223> Primer for PCR 
<400> 31 

agcataagct tttaatccaa atcgatgga 

<210> 32 
<21 1 > 33 
< 212 > DNA 

<2 13 > Artificial Sequence 
<22 0 > 

<223> Primer for PCR 
<400> 32 

actacgagct cggccccacc acccatcaac aag 



<210> 33 
<211> 34 
<212 > DNA 

20 <213> Artificial Sequence 

<22 0 > 

<223> Primer for PCR 

25 <4Q0> 33 

acttagaagc tttcagtcct cagccccctc ttcc 

<210> 34 
< 211 > 66 

30 <212> DNA 

<213> Artificial Sequence 

< 220 > 

<223 > Primer for PCR 
35 



29 



33 



34 



<400> 34 
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aatctggatc cataacttcg tatagcatac attatacgaa gttatctgca ggattcgagg 60 

gcccct 66 



<2 10> 35 
5 <2ll> 82 

<212 > DNA 

<213> Artificial Sequence 

<220 > 

10 <223 > Primer for PCR 



<400> 35 

aatctgaatt ccaccggtgt taattaaaca acttcgtata atgtatgcta tacgaagtta 60 

tagatctaga gtcagcttct ga 82 

15 

<2 10 > 36 
<2 11> 62 
<2 12 > DNA 

<213> Artificial Sequence 

20 

< 220 > 

<223 > Primer for PCR 



<400> 36 

25 atttaggtga cactatagaa ctcgagcagc tgaagcttaa ccacatggtg gctcacaacc 60 

ac 62 

<2 1 0 > 37 
<211 > 54 

30 <212 > DNA 

<213 > Artificial Sequence 

< 220 > 

<22 3 > Primer for PCR 



<4 00 > 37 




WO 00/32773 



PCT/US99/27990 



43 

aacgacggcc agtgaatccg taatcatggt catgctgcca ggtggaggag ggca 54 

<210> 38 

<2 11> 31 <212> DNA 

5 <213 > Artificial Sequence 

<220 > 

<223 > Primer for PCR 
10 <4 00> 36 

attaccaccg gtgacacccg cttcctgaca g 31 



<2 10> 39 
<211 > 61 

15 < 2 12 > DNA 

<213> Artificial Sequence 

< 220 > 

<223> Primer for PCR 

20 

<400> 39 

attacttaat taaacatggc gcgccatatg gccggcccct aattgcggcg catcgttaat 60 

t 61 

25 <210> 4 G 

< 2 1 1 > 34 

<212 > DNA 

<2 1 3 > Artificial Sequence 

30 <22 0> 

<223> Primer for PCR 

<400> 4C 

attacggccg gccgcaaagg aattcaagat ctga 34 

35 



<210> 41 
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<2 11 > 34 
<2 12 > DNA 

<213 > Artificial - Sequence 
<2 2 0 > 

<223> Primer for PCR 
<400> 41 



attacggcgc gcccctcaca ggccgcaccc agct 



34 
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